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ABSTRACT 

We develop a model for the cosmological role of mergers in the evolution of starbursts, quasars, and 
spheroidal galaxies. By combining theoretically well-constrained halo and subhalo mass functions as a func- 
tion of redshift and environment with empirical halo occupation models, we can estimate where galaxies of 
given properties live at a particular epoch. This allows us to calculate, in an a priori cosmological manner, 
where major galaxy-galaxy mergers occur and what kinds of galaxies merge, at all redshifts. We compare this 
with the observed mass functions, clustering, fractions as a function of halo and galaxy mass, and small-scale 
environments of mergers, and show that this approach yields robust estimates in good agreement with obser- 
vations, and can be extended to predict detailed properties of mergers. Making the simple ansatz that major, 
gas-rich mergers cause quasar activity (but not strictly assuming they are the only triggering mechanism), we 
demonstrate that this model naturally reproduces the observed rise and fall of the quasar luminosity density 
from z = 0-6, as well as quasar luminosity functions, fractions, host galaxy colors, and clustering as a function 
of redshift and luminosity. The recent observed excess of quasar clustering on small scales at z ~ 0.2-2.5 is a 
natural prediction of our model, as mergers will preferentially occur in regions with excess small-scale galaxy 
overdensities. In fact, we demonstrate that quasar environments at all observed redshifts correspond closely 
to the empirically determined small group scale, where major mergers of ~ L„ gas-rich galaxies will be most 
efficient. We contrast this with a secular model in which quasar activity is driven by bars or other disk insta- 
bilities, and show that while these modes of fueling probably dominate the high-Eddington ratio population at 
Seyfert luminosities (significant at z = 0), the constraints from quasar clustering, observed pseudobulge popu- 
lations, and disk mass functions suggest that they are a small contributor to the z > 1 quasar luminosity density, 
which is dominated by massive BHs in predominantly classical spheroids formed in mergers. Similarly, low- 
luminosity Seyferts do not show a clustering excess on small scales, in agreement with the natural prediction of 
secular models, but bright quasars at all redshifts do so. We also compare recent observations of the colors of 
quasar host galaxies, and show that these correspond to the colors of recent merger remnants, in the transition 
region between the blue cloud and the red sequence, and are distinct from the colors of systems with observed 
bars or strong disk instabilities. Even the most extreme secular models, in which all bulge (and therefore BH) 
formation proceeds via disk instability, are forced to assume that this instability acts before the (dynamically 
inevitable) mergers, and therefore predict a history for the quasar luminosity density which is shifted to earlier 
times, in disagreement with observations. Our model provides a powerful means to predict the abundance and 
nature of mergers, and to contrast cosmologically motivated predictions of merger products such as starbursts 
andAGN. 

Subject headings: quasars: general — galaxies: active — galaxies: evolution — cosmology: theory 



1. INTRODUCTION 

1.1. Motivation 

Over the past decade, observations have established that 
supermassive black holes lik ely reside in the centers of all 
galaxies with spheroid s (e.g.. iKormendv & R ichstonej 1 19951: 
iRichstone etait Il998t IKormendv & Gebhardtl 12001b . and 
that the properties of these black holes and their hosts are 
correlated. These correlations take various forms, relating 
the b l ack hole mass to e.g. th e mass ( Magorri an et al.l 
19981 iMcLure & Dunlopl l2002t iMarconi & Huntl 120031: 
Haring & Rix 2 0041). v e locity d ispersion (iFerrarese & Merrittl 
2000t iGebhardt et alj l2000t iTremaine et al.l 120021) and 
concentration or Ser sic index dGraham etalT l200lt 
i Graham & Driyerl 120 06) of the spheroid. Most recently, 
Hopki ns"et al.l d2007d) have demonstrated that these relation- 
ships are not independent and can be understood as various 
projections of a black hole fundamental plane analogous to 
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the fundamental plane for ellipti cal galaxies dDressler et al.l 
Il987t iDiorgovski & Davisl 119871) . The striking similarity 
between these two fundamental planes indicates that galaxy 
spheroids and supermassive black holes are not formed 
independently, but originate via a common physical process. 

Furthermore, although there may be some relatively weak 
evolution in the correlation between BH mass and host 
mass or velocity dispersion owing to changes in spheroid 
struct ural properties and internal correlations with redshift 
(e.g. iPeng et al.ll2006t [Shields et al.ll20q3l l2006t IWalter et al. 
2004t [Salviander et al.ll2006l: IWoo et al.ll2006t iHopkins et al. 
2006e ), the fundam ental plane appears to be preserved 
(Hopk ins et al.l200~7cl) . and in any case some correlation exists 
at all redshifts. There are not, at any redshifts, bulgeless sys- 
tems with large black holes or bulges without correspondingly 
large black holes. This empirically demonstrates that what- 
ever process builds up black hole mass must trace the forma- 
tion of spheroids (albeit with potentially redshift-dependent 
efficiency). 

These connections extend to other phenomena associ- 
ated with galaxies that have sometimes been interpreted as 
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being independent. For examp le, by esti mating the to- 
tal energy radiated by quasars, ISoltanl (119821) showed that 
nearly all the mass in supermassive black holes must have 
been accumulated during periods of bright quasar activ- 
ity. This analy sis has since been revisited on a num- 



ber of occasions (ISalucci et alJll999b lYu & Tremainel 12 002: 
IMarconi et al.l l2004t IShankar et alJ l2004t lYu & Lul l200¥ . 
with various ass umptions for quasar ob scuration and bolomet- 



ric corrections. Hopkins et al 



( 2007e) have reformulated the 
Soltan argument from the evolution of the bolometric quasar 
luminosity function (LF). In their analysis, Hopkins et al. 
combined observations of the quasar LF in a variety of wave- 
bands with purely empirical determinations of the luminos- 
ity dependence of quasar obscuration and spectral emission 
to infer the bolometric quasar LF. By integrating this over lu- 
minosity and redshift, it is then possible to obtain a model- 
independent estimate of the total energy density of radiation 
from quasars. The cosmic black hole mass density then fol- 
lows if black hol es in quasars accrete with constant radia- 
tive efficiency e r (IShakura & Sunyaevlll973l) . by integrating 
Lboi = c r AfgH c 2 . This yields a z = black hole mass density of 



/; B1 ,(; =0)=4.8l!^( — 



h% x 10 5 M Q Mpc" 



(1) 



consistent with estimates of pbh(z = 0) obtained from lo- 
cal bul ge mass, luminosity, and velocity dispe rsion functions 
(e.g.. IMarconi et al.ll200H IShankar et al.ll20M) . 

Taken together, the black hole fundamental plane and 
the Soltan argument imply that the common physical pro- 
cess which produces galaxy spheroids and supermassive 
black holes also must be responsible for triggering most 
bright quasars. Moreover, there is compelling evidence 
that quasar activity is preceded by a period of intense 
star formation in galaxy centers so that, for example, ul- 
traluminous infrared galaxies (ULIRGs) and distant sub- 
mi llimetergjdjixies_J^MGs]_wouU 

quasars (|Sanders et aD Il988ald: ISanders & Mirabel [1991 
iDasyra et al .1120 06b). Essentially all sufficiently deep stud- 
ies of the spectral energy distributions (SEDs) of quasar 
host galaxies reveal the presence of yo ung stellar pop- 



19995: ICanalizo & Stockton! 


20011: 


Kauffmann et al.1 u003; 


Yipetal. 2004; Jahnkeetal. 


2004a 


b; ISanchez et al.M20o4 


Vanden Berk et al.l 120061 iBarthell 
2006). There further appears to be a 
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that the most l uminous quasars have the youngest host stel- 
lar populations dJahnke et al.l2004al:rVanden Berk et al.l2006h 
and the greatest prominence of post-merge r tidal features and 
distur bances jC analizo & Stockto n! 120011: iKauffmann et al 



2003; Hutchings et al. 2003; Hutchings 2003; Hutchings et al 
2006; Zak amska et al.1 120061: iLetawe et al.l l2006). These ob- 
servations indicate that intense starbursts must result from the 
same process as most quasars and supermassive black holes. 

In the simplest interpretation, we seek an explanation for 
the various phenomena summarized above such that they re- 
sult from the same event. There are general, theoretical re- 
quirements that any such event must satisfy. In particular, it 
must be fast and violent, blend together gas and stellar dynam- 
ics appropriately, and involve a supply of mass comparable to 
that in large galaxies. Why should this be the case? 

The accepted picture for the growth of supermassive black 
holes is that the mas s is primarily assembled by gas accretion 
dLvnden-Bell| [T969). Fr om the Soltan argument, we know 
that this mass must be gathered in a time comparable to the 



lifetim es of bright quasars, which is similar to the Salpeter 
(1964) time - 10 7 5 years, for bla ck holes accre ting at the 
Eddington rate. Independent limits (Martini 2004, and refer- 
ences therein) from quasar clustering, variability, luminosity 
function evolution, and other methods demand a total quasar 
lifetime (i.e. duration of major growth for a given BH) of 
< 10 8 5 yr. In order to explain the existence of black holes 
with masses ~ 10 9 M Q , the amount of gas required is likely 
comparable to that contained in entire large galaxies. Thus, 
the process we seek must be able to deliver a galaxy's worth 
of gas to the inner regions of a galaxy on a relatively short 
timescale, <C 10 9 years. 

If this event is to simultaneously build galaxy spheroids, it 
must involve stellar dynamics acting on a supply of stars sim- 
ilar to that in large galaxies because the stellar mass is ~ 1000 
times larger than that of the black hole and it is believed that 
spheroids are assembled mainly (albeit not entirely) through 
dissipationless physics (i.e. the movement of stars from a cir- 
cular disk to random spheroid orbi ts). A plausible ca ndidate 
process is violent relaxation (e.g. Lyn den-Belll fT967) which 
has been demonstrated to yield phase space distributions akin 
to those of elliptical galaxies through large, rapid fluctuations 
in the gravitational potential. Violent relaxation operates on a 
timescale similar to the free-fall time for self-gravitating sys- 
tems, again <C 10 9 years for the bulk of the mass. 

Motivated by these considerations, Hopki ns et al.l ((2006a) 
developed a model where starbursts, quasars, supermas- 
sive black hole growth, and the formation of red, ellipti- 
cal galaxies are connected through an evolutionary sequence, 
caused by mergers between gas-rich galaxies. There is, 
in fact, considerable observational evidence indicating that 
mergers are responsible for triggering ULIRGs, SMGs, and 
quasars ( see references in Hopkin s et al. 2006a; f or re- 
views see iBarnes & Hernquistlll992t ISchweized[l998l: Uoged 
2004). Furthermore, the long-standing "merger hypothesis," 
whi ch proposes that most elliptical galaxies f ormed in merg- 
ers (iToomre & Toomrd 119721: iToomrd Il977l) . is supported 
by the structure of known ongoin g mergers (e.g., ISchweizerl 
119921: iRothberg & Joseph! l2006allbl and the ubiquitous pres- 
ence of fine structures such as shells, ripples, tidal plumes, 
nucl ear light excesses, and kinematic subsystems i n ellipticals 
(e.g. [Schweizer & SeitzeJll992tlSchweizeril 19961). which are 
signa t ures of mergers (e.g. lOu inn 1984; Hernauist & Ouinnl 



19871: iHernauist & Spergellll992t iHernquist & Barneslll991 
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Numerical simulations performed during the past twenty 
years verify that major mergers of gas-rich disk galaxies 
can plausibly account for these phenomena and have eluci- 
dated the underlying physics. Tidal torques excited during a 
mer ger lead to rapid inflows of gas into the cen t ers of galax- 
ies EEnauiEIiall |lirneI&Herna^ [Till [1996). The 
amount of gas involved can be a large fraction of that in 
the progenitor galaxies and is accumulated on rou ghly a dy- 
namic al time in the inner regions, <C 10 9 years (Hernquist 
1989). The resulting high gas densities trigger starbursts 
(Miho s & Hernquist|[l994bl,ll996l) . and feed rapid black hole 
growth (Di Mat teo et al.l l2005). Gas consumption by the star- 
burst and dispersal of residual gas by su pernova-driven winds 
and feedback from black hole growth ( Spri nge! et alj [2005a) 
terminate star formation so that the remnant quickly evolves 
from a blue to a red galaxy. The stellar component of the pro- 
genitors provides the bulk of the material for producing the 
remnant spheroid dBarnesll 1 98811 1 992t [Hernquist! 1 992l 1 1 993h 
through violent relaxation. 
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The simulations also place significant constraints on the 
types of mergers that can initiate this sequence of events. 
First, a major merger is generally required in order for the 
tidal forces to excite a sufficiently strong response to set up 
nuclear inflows of gas. Although simulations involving minor 
mergers with mass ratios ~ 10 : 1 show that gas inflows can 
be excited under some circumstances (e.g. lHernquist|[T989l: 
iHernquist & Mihosl 19951; iBournaud et al.l2005l) . a systematic 
study indicates t hat such an outcome is limited to specific or- 
bital geometries (Younger et al. 2007) and that the overall ef- 
ficiency of triggering inflows declines rapidly with increas- 
ing mass ratio. Thus, while the precise definition of a ma- 
jor merger in this context is blurred by the degeneracy be- 
tween the mass ratio of the progenitors and the orbit of the 
interaction, it appears that a mass ratio ~ 3 : 1 or smaller 
is n eeded. This is further supported by ob servational stud- 
ies (iDasvra et al.ll2006at IWoods et al.ll2006l) . which find that 
strong gas inflows and nuclear starbursts are typically seen 
only below these mass ratios, despite the much greater fre- 
quency of higher mass-ratio mergers. 

Second, the merging galaxies must contain a supply of cold 
gas, which in this context refers to gas that is rotationally sup- 
ported, in order that the resonant response leading to nuclear 
inflows of gas in a merger be excited. Elliptical galaxies con- 
tain large quantities of hot, thermally supported gas, but even 
major mergers between two such objects will not drive the 
nuclear inflows of gas that fuel rapid black hole growth. 

It also must be emphasized that essentially all numeri- 
cal studies of spheroid kinematics find that only mergers 
can reproduce the observed kinemat ic properties of ellipti- 
cal galaxies and "classical" bulges dHernquistl 1 19891 ] 19921 
1993MBarnesll 19881 1 1992tlSchweizedll 9921: iNaab et alj|l999t 
Naab & Burkertl 120031: iNaab et al.l 2006a b; Naa b & Tmiiiiol 
2006t IBournaud et al.1 120051 Uesseit et al.1 l2006t ICoxetalJ 
2006b). Disk instabilities and secular evolution (e.g. bar 
instabilities, harassment, and other isolated modes) can in- 
deed prod uce bul ges, but these are in variabl y "pseudobulges" 
dSchwarzl 119811: lAthanassoula et al.1 [19831: iPfennigeri Il984t 



Combes et alj 119901: iRaha et al l 119911: iKuiiken & Merrifieldl 



1995; O'Neill & Dubinski 2003 



lAthanassoula! l2005h . with 
clearly distinct shapes (e.g. flattened or "peanut"-shaped 
isophotes), rotation properties (large v/a), internal correla- 
tions (obeying different Kormendy and Faber-Jackson rela- 
tions), light profiles (nearly exponential Sersic profiles), and 
colors an d/or substructure from classic al bulges (for a re- 
view, see iKormendv & Kennicuttl l2004). Observations indi- 
cate that pseudobulges constitute only a sm all fraction of the 
total mass densit y in spheroids (< 1 0%; see lAllen et all2006t 
iBall et al.1 120061: Driver et al.1 120071) . becoming a large frac- 
tion of the bulge population only for small bulges in late-type 
hosts (e.g. Sb/c, corresponding to typical Mr H 5, 1 7 M^; see 
ICarollo et all 1 19981: IKormendv & Kennicutt)[2004 and refer- 
ences therein). Therefore, it is clear that although such pro- 
cesses may be important for the buildup of the smallest black 
hole and spheroid populations, secular evolution cannot be the 
agent responsible for the formation of most elliptical galaxies, 
or for the buildup of most black hole mass, or the triggering 
of bright quasar activity. 

We are thus led to s uggest a genera lization of the merger hy- 
pothesis proposed by Toomre (1 19771) whereby major mergers 
of gas-rich disk galaxies represent the dominant process for 
producing the supermassive black hole and spheroid popula- 
tions in the Universe. Then, by the Soltan argument and the 
association of starbursts with quasars, it follows that this must 



also be the primary mechanism for triggering the most intense 
infrared luminous galaxies and the brightest quasars and ac- 
tive galactic nuclei (AGN). It is important to keep in mind that 
this does not rule out other processes occurring at lower lev- 
els and under other circumstances. For example, we are not 
claiming that all AGN result from mergers. In fact, low levels 
of such activity, as in Seyfert galaxies, often appear in undis- 
turbed galaxies. For these objects, other modes of fueling are 
li kely more significant, as in th e stochastic accretion scenario 
of lHopkins & Hernqu ist (2006). The primary requirement on 
our model is that the bulk of the supermassive black hole mass 
density should have accumulated through gas-rich mergers, 
consis tent with the redshi ft evolution of the quasar popula- 
tion (iHopkins et al.ll20~07el) . Similarly, spheroid evolution by 
gas-free ("dry") mergers will go on, but does not explain how 
stellar mass is initially moved onto the red sequence or how 
black hole mass is initially accreted. 

1.2. Outline 

To test our hypothesis, we have developed methods for fol- 
lowing the growth of black holes in numerical simulations of 
galaxy mergers, using a multiphase model for the star-forming 
gas that enables us to consider progenitor disks with large gas 
fractions. Generically, we find that major mergers of gas-rich 
galaxies evolve through distinct phases that can plausibly be 
identified with the various observed phenomena summarized 
above. 

Figure Q] presents a schematic outline of these phases. In 
this picture, galactic disks grow mainly in quiescence, with 
the possibility of secular-driven bar or pseudobulge forma- 
tion, until the onset of a major merger. A significant, per- 
haps even dominant fraction of Seyferts and low-luminosity 
quasars will almost certainly arise from this secular evolu- 
tion, but the prevalence of pseudobulges only in the hosts of 
< 10 7 M© black holes suggests this is limited to luminosities 
Mb > -23 (see the discussion in § 13. 3b . 

During the early stages of the merger, tidal torques excite 
some enhanced star formation and black hole accretion, but 
the effect is relatively weak, and the combination of large 
galactic dust columns and relatively small nuclear black holes 
means that only in rare circumstances (involving particular 
initial orbits and/or bulge-to-disk ratios) will the pair be iden- 
tified as Seyferts or quasars. Most observationally identi- 
fied mergers (and essentially all merging pairs) will be in 
this stage, and numerical simulations suggest it is the last 
stage at which the distinct nuclei enable automated morpho- 
logical sel ection criteria to efficie ntly identify the system as 
a merger (lLotz et all 120041 12007). Care must therefore be 
taken with conclusions regarding the prevalence of starbursts 
and AGN in these samples, as the smal l observed incidence 
of quasar activity (iDasvra et a l. 2006a; My ers et aD 12006b: 
IStraughn et al.ll2006llAlonso et al.ll2007l) is actually expected. 

During the final coalescence of the galaxies, massive in- 
flows of gas trigger starbursts with strengths similar to those 
inferred for ULIRGs and SMGs, although the actual mass 
in stars formed in these bursts is generally small compared 
to the stellar mass contributed by the merging disks. The 
high gas densities feed rapid black hole growth, but the 
black holes are obscured at optical wavelengths by gas and 
dust and are initially small compared to the newly form- 
ing spheroid. However, by the final stages, high accretion 
rate, heavily obscured (and in some cases nearly Compton- 
thick) BH growth in a ULIRG stag e (often with merg- 
ing binary BHs) appears ubiquitous dKomossa et al.l 120031: 



(c) lnteraction/"Merger" 



(d) Coalescence/(U)URG 



(e) "Blowout" 



(f) Quasar 




now within one halo, galaxies interact & 

lose angular momentum 
SFR starts to increase 
stellar winds dominate feedback 
rarely excite QSOs (only special orbits) 

(b) "Small Group" 



■ galaxies coalesce: violent relaxation in core 

■ gas inflows to center: 

starburst & buried (X-ray) AGN 
• starburst dominates luminosity/feedback, 
but, total stellar mass formed is small 





■ halo accretes similar-mass 

companion(s) 

■ can occur over a wide mass range 

■ Mhaio still similar to before: 

dynamical friction merges 
the subhalos efficiently 



(a) Isolated Disk 



m 



halo & disk grow, most stars formed 
secular growth builds bars & pseudobulges 
"Seyfert" fueling (AGN with Mb>-23) 
cannot redden to the red sequence 



- BH grows rapidly: briefly 

dominates luminosity/feedback 

- remaining dust/gas expelled 

- get reddened (but notType II) QSO: 

recentiongoing SF in host 
high Eddington ratios 
merger signatures still visible 



-1 1 

Time (Relative to Merger) [Gyr] 



dust removed: now a "traditional" QSO 
host morphology diff cult to observe: 

tidal features fade rapidly 
characteristically blue/young spheroid 



(g) Decay/K+A 




QSO luminosity fades rapidly 

- tidal features visible only with 
very deep observations 

remnant reddens rapidly (E+A/K+A) 
"hot halo" from feedback 

- sets up quasi-static cooling 



(h)"Dead" Elliptical 




■ star formation terminated 

■ large BH/spheroid - efficient feedback 

■ halo grows to "large group" scales: 

mergers become ineffl cient 

■ growth by "dry" mergers 



FIG . 1 . — An schematic outline of the phases of growth in a "typical" galaxy undergoing a gas-rich major merger. Image Credit: (a) NOAO/AURA/NSF; (b) REU program/NOAO/AURA/NSF; (c) NASA/STScI/ACS 
Science Team; (d) Optical (left): NASA/STScI/R. P. van der Marel & J. Gerssen; X-ray (right): NASA/CXC/MPE/S. Komossa et al.; (e) Left: J. Bahcall/M. Disney/NASA; Right: Gemini Observatory/NSF/University 
of Hawaii Institute for Astronomy; (f) J. Bahcall/M. Disney/NASA; (g) F. Schweizer (CIW/DTM); (h) NOAO/AURA/NSF. 
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lAlexander etaill2005at iBorvs et al.ll2005t iBrand et al.ll2006h . 
and by high redshifts (z ~ 2) may dominate the obscured lumi- 
nous quasar population ([Alexander et al. 2005b; St evens et alj 
120051: iMarffnez-Sansigre et al.ll2006tlBrand et al.ll2007l) . 

Most of the nuclear gas is consumed by the starburst 
and eventually feedback from supernovae and the black 
hole begins to disperse the residual gas. This brief tran- 
sition or "blowout" phase will be particularly associated 
with highly dust-reddened (as opposed to more highly ob- 
scured Type II) and/or IR-luminous quasars. As a relatively 
short phase, such objects constitute only ~ 20 - 40% of the 
quasar population similar to that observed dGregg et al 120021; 
White et aUl2003URichards et al.ll2003l l2006at iHopkins et al l 



2004). In fact, observational studies find that red quasar 
populations are related to mergers, with > 75% (and as 
high as 100%) showing clear evidence of recent/ongoing 
merging dHutchings et atll2003l 120061: iKawakatu et al.ll2006t 
iGuvon et all l2006t lUrrutia et al.l 120071). with young post- 
starburst stellar populations dGuyon et al.ll2006l) . much of the 
dust arising on scal es of the galaxy (in turbulent motions, in- 
flow, and outflow; lUrrutia et al.l 120071) . and extremely high 
Eddington ratios indicative of a still active period - making 
them (as opposed to most fully obscured quasars) a substan- 
tial contributor to the most luminous quasar s in the Universe 
(Whit e et all 120031; iHutchings etail 120061: IZakamskaet ail 
2006). As the dust is removed, the black hole is then visi- 
ble as a traditional optical quasar (although very small-scale 
"torus" obscuring structures may remain intact, allowing for 
some rare, bright Type II systems). 

Here, observati ons of the host morphology are more am- 
biguous (see e.g. i Bahcall et al.l Il997t ICanalizo & Stockton 
200 It iFlovd et all 120041; IZakamska et al l 120061; iPierce et all 
2006), but this is expected, for two reasons. First, the point 
spread function of the bright and unobscured optical quasar 
must be subtracted and host galaxy structure recovered, a dif- 
ficult procedure. Second, by this time the merger is com- 
plete and the spheroid has formed, leaving only fading tidal 
tails as evidence for the recent m erger. Mock obser vations 
constructed from the simulations (Kra use et alj 12007) imply 
that, with the best presently attainable data, these features are 
difficult to observe even locally and (for now) nearly impos- 
sible to identify at the redshifts o f greatest interest (z > 1). 
This appears to be borne out, as iBennert et alj (120071) have 
re-examined low-redshift quasars previously recognized from 
deep HST imaging as having relaxed spheroid hosts, and 
found (after considerably deeper integrations) that every such 
object shows clear evidence for a recent merger. These dif- 
ficulties will lead us to consider a number of less direct, but 
more robust tests of the possible association between mergers 
and quasars. 

Finally, as the remnant relaxes, star formation and quasar 
activity decline as the gas is consumed and dispersed, and 
the remaining galaxy resembles an elliptical with a quiescent 
black hole satisfying observed correlations between black 
hole and spheroid properties. During this intermediate ^Gyr 
decay, depending on details of the merger and exact viewing 
time, the remnant may be classified as a low-luminosity (de- 
caying) AGN in a massive (and relatively young) spheroid, or 
as a post-starburst (E+A/K+A) galaxy. Observationally, the 
link between K+A galaxies and mergers is well- established 
(e.g. lYang et aLll2004t lGotoll2005l; IMogg et al.ll2006l and ref- 
erences therein), and there is a clear tendenc y for these galax- 
ies to host l ow-luminosity AGN or LINERs dYang et al.fl2006l: 
lGoto1l2006h . A gain, for the reasons given above, the situation 



is less clear for all low-luminosity AGN (and there will be, 
as noted above, many such sources driven by secular mech- 
anisms in disks). But more importantly most objects seen 
in this stage are expected to have relaxed to resemble nor- 
mal spheroids. The merger exhausts gas and star formation 
in an immediate sense very efficiently, so the remnant red- 
dens rapidly onto the red sequence. If this is also associated 
with quenching of future star formation (see Paper II), then the 
spheroid will evolve passively, growing largely by dry merg- 
ers. 

Individual simulations of mergers have enabled us to quan- 
tify the duration of these stages of evolution and how this 
depends on properties of the merging galaxies, such as their 
masses and gas content and the mass ratio and orbit of the en- 
counter. In particular, we used the res ults to suggest a phys - 
ical interpretation of quas ar lifetimes ([H opkins et al. 2005d), 
to examine how quasars 

(Hopkins_eiaLl2005a) 

and starbursts 

dChakr abarti et al] |2007l) would evolve in this scenario, and 
quantify structural properties of the remnant and how they 
depend on e.g. t he gas fractions of the m e rging galaxies 
dCox et al.l l2006allbl: iRobertson et al.l I2006bll3: IHopkins etail 
I2007d) . 

In addition to making predictions for individual systems, 
we would also like to characterize how entire populations of 
objects would evolve cosmologically in our picture to test the 
model against the large body of observational data that exists 
from surveys of galaxies, quasars, and starbursts. Previously, 
we have adopted a semi-empirical approach to this problem, 
as follows. In our simulations, we can label the outcome by 
the final black hole mass in the remnant, Mbhj or, equiva- 
lently, the peak bolometric luminosity of the quasar, h pe ak- 
Our simulations predict a regular behavior for the evolution of 
the different merger phases as a function of Mbhj or L pea k and 
also for the properties of the remnant as a function of Mbhj 
or L pea k. If we have an estimate of the observed distribution 
of systems in one phase of the evolution, we can then use our 
models to deconvolve the observations to infer the implied 
birthrate of such objects as a function of Mbhj or L pet & . Given 
this, the time behavior of the simulations provides a mapping 
between the different phases enabling us to make indepen- 
dent predictions for other populations. For example, knowing 
the observed quasar luminosity function (QLF) at some red- 
shift, our simulations allow us to predict how many quasar- 
producing mergers of a given mass must be occurring at the 
time, which can then be tested against the observed merger 
statistics. 

We exploited this approach to examine the relationship be- 
tween the abundance of quasars and other manifestations of 
quasar activity, and showed that our model for quasar life- 
times and li ghtcurves yields a me ans to interpret the shape 
of the QLF (Hopki ns et alJl2005bh . provides a consistent ex- 
planation fo r observations of the QLF at optical and X-ray 
frequencies (Hopki ns et al.ll2005cl) . ex plains observed evolu - 
tion in the faint-end slope of the QLF dHopkins et al.l l2006b). 
and can acc ount for the spectral shape of the cosmic X-ray 
background (Hopkin s et al.l|20 06a, 2007e). Using this tech- 
nique to map between different types of objects, we demon- 
strated that the observed evolution and clustering of the quasar 
population is consistent with observ ations of red galaxies 
dHopkins etail 120064 l 2007 4 l2006dl) and merging systems 
(Hopkins et al. 2007a, 2006f), as well as the mass function 



of super massive black holes and its e stimated evolution with 
redshift dHopkins et al.l20 06a, 2007 e). In each case, we found 
good agreement with observations provided that the mappings 



6 



Hopkins et al. 



were based on the lifetimes and lightcurves from our merger 
simulations and not idealized ones that have typically been 
used in earlier theoretical studies. We fu rther showed that our 
picture makes numerous predictions dHopkins et alj l2007al 
2006a) that can be used to test our hypot hesis, such as th e 
luminosity dependence of quasar clustering dLidz et al.l2006l) . 
However, the cosmological context of our results was not pro- 
vided in an entirely theoretical manner because our analysis 
relied on an empirical estimate of one of the connected popu- 
lations. 

Obtaining a purely theoretical framework for our scenario 
is difficult because cosmological simulations including gas 
dynamics currently lack the resolution to describe the small- 
scale physics associated with disk formation, galaxy mergers, 
star formation, and black hole growth. Semi-analytic methods 
avoid some of these limitations, but at the expense of parame- 
terizing the unresolved physics in a manner this is difficult to 
calibrate independently of observational constraints. For the 
time being, neither approach is capable of making an entirely 
ab initio prediction for how the various populations we are 
attempting to model would evolve with time. 

In this paper, we describe a strategy that enables us, for 
the first time, to provide a purely theoretical framework 
for our picture. Our procedure is motivated by, but does 
not rely upon, observations suggesting that there is a char- 
acteristic halo mass hosting bright quasars. This infer- 
ence follows from measurements of the clustering of quasars 
in the 2dF, SPSS, and ot h er surveys JPorciani et al.l l2004t 
Porciani & Norberd Eppl IWake et all 120 41: ICroom et al. 
20051 : ICoil et alJ l2007t iMyers et al.l l2006at Ida Angela et al. 
20061; IShen et al. 2007) a nd investigations of the quasar prox- 
imity effect dFaucher-Giguere et alj20()7tlKim & Croftll2007l: 
iNascim ento Guimar aes et al.l I2007L By adopting simple 
models for the merger efficiency of galaxies as a function of 
environment and mass ratio, we show that this characteristic 
halo mass for quasars corresponds to the most favorable en- 
vironment for major mergers between gas-rich disks to occur, 
namely the "small group" scale. This finding argues for an in- 
timate link between such mergers and the triggering of quasar 
activity and naturally leads to a method for determining the 
redshift evolution of the quasar population from dark matter 
simulations of structure formation in a ACDM Universe. 

By combining previous estimates of the evolution of the 
halo mass function with halo occupation models and our esti- 
mates for merger timescales, we infer the statistics of mergers 
that excite quasar activity. We then graft onto this our mod- 
eling of quasar lightcurves and lifetimes, obtained from our 
simulations of galaxy mergers that include star formation and 
black hole growth to deduce, in an ab initio manner, the red- 
shift dependent birthrate of quasars as a function of their peak 
luminosities and the corresponding formation rate of black 
holes as a function of mass. Because our merger simulations 
relate starbursts, quasars, and red galaxies as different phases 
of the same events, we can then determine the cosmological 
formation rate of these various populations and their evolu- 
tion with redshift. In particular, as we demonstrate in what 
follows, the observed abundance of all these objects is well- 
matched to our estimates, unlike for other theoretical mod- 
els, supporting our interpretation that mergers between gas- 
rich galaxies represent the dominant production mechanism 
for quasars, intense starbursts, supermassive black holes, and 
elliptical galaxies. 

We investigate this in a pair of companion papers. Here 
(Paper I), we describe our model and use it to investigate the 



properties of mergers and merger-driven q uasar activity. In 
the companion paper ( Hopkins et al. 2007b, henceforth Paper 
II), we extend our study to the properties of merger remnants 
and the formation of the early-type galaxy population. Specif- 
ically, § [2] outlines our methodology, describing the physi- 
cal criteria for and identification of major mergers (§ 12.11 ). 
the distribution of mergers across different scales and galaxy 
types (§ 12.21) . and the dependence of mergers on environmen- 
tal properties (§ 12.31 ). We then examine the predicted merger 
mass functions, fractions, and clustering properties from this 
model, and compare with observations to verify that we are 
appropriately modeling the merger history of the Universe 
(§ 12. 4b . In § [3] we examine the consequences of a general 
model in which mergers trigger quasar activity. We present a 
number of robust predictions both independent of (§ 13.11 ) and 
including (§ 13. 2\ physical models for the quasar lightcurves 
and duty cycles in mergers. We contrast this with a "secular" 
model in which quasar activity is caused by disk instabilities 
(§ 13.31 ). and show that a variety of independent constraints 
suggest that such a mode cannot dominate the formation of 
bright, high redshift quasars. We discuss and summarize our 
conclusions in §[4] 

Throughout, we adopt a WMAP3 {VLm, ^ Ai h, erg, n s ) = 
(0.26 8, 0.732, 0.704, 0.776, 0.947) cosmology dSoergel et al] 
2006), and normalize all observations and models shown to 
these parameters. Although the exact choice of cosmology 
may systematically shift the inferred bias and halo masses 
(primarily scaling with cs), our comparisons (i.e. relative bi- 
ases) are for the most part unchanged, and repeating our cal- 
culations for a "concordance" (0.3, 0.7, 0.7, 0.9, 1.0) cosmol- 
ogy or the WMAP 1 (0.27,0.73,0.71,0.84,0.96) results of 
Sperg eTet alJ d2003l) has little effect on o ur conclus i ons. W e 
also adopt a diet Salpeter IMF following [Bell et al J d2003bl) . 
and convert all stellar masses and mass-to-light ratios accord- 
ingly. Again, the choice of the IMF systematically shifts the 
normalization of stellar masses herein, but does not substan- 
tially change our comparisons. UBV magnitudes are in the 
Vega system, and SDSS ugriz magnitudes are AB. 

2. MERGERS 

2.1. What Determines Whether Galaxies Merge 

2.1.1. Physical processes 

To begin, we postulate which mergers are relevant to our 
picture. Minor mergers (mass ratios ^> 3 : 1) will not trigger 
significant star formation or quasar activity for most orbits, 
and consequently will neither exhaust a large fraction of the 
larger galaxy's gas supply nor be typically identified as merg- 
ers observationally. We are therefore specifically interested in 
major mergers, with mass ratios < 3 : 1, but note that our con- 
clusions are unchanged if, instead of this simple threshold, we 
include all mergers and adopt some mass-ratio dependent ef- 
ficiency (e.g. assuming the fractional BH/bulge growth scales 
with mass ratio R in some powe r-law fashion, oc R~ l , as sug- 
gested by numerical simulations; Younger et al. 2007). In this 
case, the decreasing efficiency of BH fueling in minor merg- 
ers leads (as expected) to the conclusion that they are only 
important at low masses/luminosities (similar to where secu- 
lar activity may dominate quasar populations; see § 13.31) . and 
our predictions for massive bulges and BHs are largely unaf- 
fected. If the timescale for two galaxies to merge is long com- 
pared to the Hubble time, they clearly will not have merged 
in the actual Universe. However, the merger timescale must 
also be short compared to the time required to tidally strip or 
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disrupt either of the galaxies - if it is not, then by the time the 
galaxies finally coalesce, the end result will simply be tidal 
accretion of material at large radii. 

This defines two fundamental criteria for galaxy mergers to 
occur in the setting of a halo of mass Mh a i : 

• The halo must host at least two galaxies of comparable 
mass ~ Mgai. Note that even for mergers of distinct host 
halos in the field, the halo-halo merger proceeds much 
faster than the merger of the galaxies, so there is some 
period where the two can be considered distinct sub- 
structures or distinct galaxies within a common host. 

• The merger must be efficient - i.e. occur in much less 
than a Hubble time. This requires that the mass of the 
galaxies and their associated (bound) dark matter sub- 
halos be comparable to the mass of the parent halo (e.g. 
for the simplest dynamical friction arguments, requir- 
ing M ha . /M gal < 30). 

Together, these criteria naturally define a preferred mass 
scale for major mergers (host halo mass Mhaio) for galaxies 
of mass M ga i. A halo of mass (Mh a i )(M ga i) typically hosts 
a galaxy of mass M ga i. At smaller (relative) halo masses 
^haio (Mh a io), the probability that the halo hosts a galaxy as 
large as M ga i declines rapidly (and eventually must be zero or 
else violate limits from the cosmic baryon fraction). At larger 
Mhaio 3> (Mhaio)i the probability that the halo will merge with 
or accrete another halo hosting a comparable ~ M ga i galaxy 
increases, but the efficiency of the merger of these galaxies 
declines rapidly. Eventually the M ga i galaxies are relatively 
small satellites in a large parent halo of mass Mh a i 3> (Mh a i ), 
for which (satellite-satellite) mergers are extremely inefficient 
(given the high virial velocities of the host, and dynamical 
friction timescales 3> tn). 

The preferred major-merger scale for galaxies of mass M„. d \ 
is therefore only slightly larger (factor ~ 2) than the average 
host halo mass for galaxies of this mass. We refer to this as 
the small group scale, and emphasize the term small in this 
name: the average halo of this mass still hosts only 1 galaxy 
of mass ~Mgai, and the identifiable groups will only consist of 
2-3 members of similar mass (although there may of course 
be several much smaller systems in the group, which have lit- 
tle dynamical effect). This is very different from large group 
scales, easily identified observationally, which consist of 3> 3 
members. 

Figure |2] illustrates several of these points. We adopt the 
merger ti mescales derived below and use the halo occupation 
fits from Wang et al. (2006) to determine the probability of a 
halo hosting a pair of galaxies of a given mass: the details of 
the formalism are described below and used throughout, but 
we wish to illustrate the key qualitative points. The merger 
timescale for galaxies of a given mass is shortest when they 
are large relative to their host halo mass, as expected from 
dynamical friction considerations. However, the probability 
of a pair being hosted cuts off sharply at low halo masses. 
Moreover, the contribution to mergers of galaxies of mass 
M ga i from larger halos is further suppressed by the simple fact 
that there are fewer halos of larger masses. 

Modern, high- resolution dark matte r-only cosmological 
simulations (e.g. ISpringel et al.l l2005d) have made it pos- 
sible to track the merger histories of galaxy halos over 
large ranges in cosmic time and halo mass. For our pur- 
poses, the critical information is contained in the subhalo 
mass function, which has been quantified in great detail di- 
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FIG. 2. — Efficiency of major galaxy mergers (of a certain galaxy mass 
relative to the characteristic local Schechter-function M*) as a function of 
host halo mass (at z = 0, but the results are qualitatively similar at all red- 
shifts). Top: Merger timescale relative to the Hubble time (assuming a pair 
of galaxies of mass M ga i are hosted in a halo of mass M|, a i ) - mergers occur 
rapidly (fmerger <C ih) when the halo mass is small relative to the galaxy mass 
(we temporarily ignore the obvious requirement that M ea i < /baryon ^halo)- 
Middle: Same, but now multiplied by the probability that the halo actually 
hosts a pair of galaxies of the given mass (technically, within a mass rati o 
3 : 1), given the empirical halo occupation model from Wane et al. 1 2006). 
Although mergers are most rapid in the lowest-mass halos, these halos do 
not host relatively massive galaxies. Bottom: Same, but further multiplied by 
the abundance of halos of a given mass - the fact that the halo mass func- 
tion and merger efficiency are decreasing functions of Mh a i (for fixed M ga i) 
means that the contribution to galaxy mergers of a given M ga i will be domi- 
nated by the lowest-mass halos in which there is a significant probability to 
accrete/host a pair of M ga i galaxies - the small group scale. 



rectly from such simulat ions dKravtsov et alj|2004t iGao et all 
l2004t iNurmi et al. 2006) and from ext ended Press-Schechter 
theory and se mi-anal ytic approaches dTavlor & Babull l2004t 
IZentner et alj [2005; Ivan den Bosch et al.l 120051) calibrated 
against numerical simulations. 

When a halo (containing a galaxy and its own subhalo pop- 
ulations) is accreted, the accretion process is relatively rapid 
- the accreted halo will always be identifiable for some period 
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of time as a substructure in the larger halo. Although the new 
subhalo may lose mass to tidal stripping, there will still be 
some dark matter subhalo associated with the accreted galaxy, 
which will remain until the substructure merges with the cen- 
tral galaxy via dynamical friction or (much more rarely) an- 
other satellite substructure. Therefore, knowing the subhalo 
populations of all halos at a given instant, the calculation 
of the rate and distribution of galaxy mergers depends only 
on calculating the efficiency of the subhalo/galaxy mergers 
within these halos. This is a great advantage - we do not 
need to calculate halo-halo merger rates, which are not well- 
defined (even when extracted directly from cosmological sim- 
ulations) and_d^2ejid_sensn^efy definitions 
(see, e.g. lGotflober et al.l2 001; Mal leret al.l2006l) . but instead 
work fro m the robust (an d well-defined) subhalo mass func- 
tion (see lGao et al.l l2004. and references therein). 

This is similar to many of the most recent semi-analytic 
models, which adopt a hybrid approach to determine galaxy 
mergers, in which galaxies survive independently so long as 
their host halo remains a distinct substructure, after which 
point a dynamical friction "clock" is started and the galaxy 
merges with the central galaxy in its parent halo at the end of 
the dynamical friction time. Fortunately, for our purposes we 
are only interested in major mergers with mass ratios < 3 : 1. 
In these cases, dynamical friction acts quickly on the subhalos 
(infall time < fe/3 at all redshifts), and the primary ambigu- 
ity will be the galaxy merger time in their merged or merging 
subhalos. 

To perform this calculation, we need to know the proper- 
ties of the merging galaxies. For now, we only want to cal- 
culate where and when galaxies are merging, not how they 
evolved to their present state in the first place. This is our pri- 
mary reason for not constructing a full semi-analytic model: 
rather than introduce a large number of uncertainties, theo- 
retical prescriptions which we are not attempting to test here, 
and tunable parameters in order to predict that e.g. a 10 u Mq 
halo typically hosts a ~ 10 10 M Q star-forming galaxy, we can 
adopt the established empirical fact that this is so. In de- 
tail, we populate subh alos according to an empirical halo oc- 
cupation model (e.g., iTin ker et al. 2005; Conroy et al .] 120061: 
Vale & Ostriker 20061 Ivan den Bosch et aTll2006t IWang et all 
2006); i.e. matching the observed statistics of where galax- 
ies of a given type live (accounting for different occupations 
for different galaxy types/colors, and the scatter in galaxies 
hosted in halos of a given mass). 

This is sufficient for most of our predictions. We do not 
necessarily need to know exactly how long it will take for 
these mergers to occur, only that they are occurring at a given 
redshift - i.e. that the objects will merge and that the merger 
time is shorter than the Hubble time (which for the mass ratios 
of interest is essentially guaranteed). For example, predicting 
the clustering of galaxy mergers does not require knowledge 
of how rapidly they occur, only where they occur. Even pre- 
dicting the observed merger mass function does not rely sen- 
sitively on this information, since the duration over which the 
merger is visible will be comparable (albeit not exactly equal) 
to the duration over which the merger occurs (such that a fixed 
fraction ~ 1 of all merging systems are observable). 

However, for the cases where it is necessary, we estimate 
the timescales for the galaxies to merge and to be identi- 
fied as mergers. This is the most uncertain element in our 
model. Part of this uncertainty owes to the large parameter 
space of mergers (e.g. differences in orbital parameters, rela- 
tive inclinations, etc.). These uncertainties are fundamental, 



but can at least be controlled by comparison to large suites 
of hydr odynamic simulations which sample these parameter 
spaces (Robertson et al. 2006b) and allow us to quantify the 
expected range of merger properties owing to these (essen- 
tially random) differences. The more difficult question is how 
appropriate any analytic merger timescale or cross section can 
be. To address this, we will throughout this paper consider a 
few representative models: 

Dynamical Friction: The simplest approximation is that the 
galaxies are point masses, and (once their subhalos merge) 
they fall together on the dynamical friction timescale. This is 
what is adopted in most semi-analytic models. In fact, this is 
only an appropriate description when the galaxies are small 
relative to the enclosed halo mass, and are both moving to the 
center of the potential well - which is often not the case at 
these late stages. While unlikely to be incorrect by orders of 
magnitude, this approximation begins to break down when the 
galaxies are relatively large compared to their halos (common 
in < 10 12 Mq halos) and when the galaxies are very close (and 
could e.g. enter a stable orbit). What finally causes galaxies 
to merge is not, in fact, simple dynamical friction, but dis- 
sipation of angular momentum via a resonance between the 
internal and orbital frequencies. 

Group Capture (Collisional): On small scales, in satellite- 
satellite mergers, or in the merger of two small field halos, it is 
more appropriate to consider galaxy mergers as a collisional 
process in which there is some effective gravitational cross 
section. In other words, galaxy mergers proceed once the 
galaxies pass at sufficiently small distances with sufficiently 
low relative velocity. There have been a number of theoreti- 
cal estimates of th ese cross sections - we adopt he re the fit- 
ting formulae from lKrivitskv & Kontorovichld 19971) . who cal- 
ibrate the appropriate cross-sections from a set of numerical 
simulations of different encounters and group environments . 
This compares well with other c alculations (White 119761 ; 
Maki no~& Hudfl997l : lMamonll2006l and references therein), 
and we find little difference using these alternative estima- 
tions. For large mass ratios and separations, the expressions 
appropriately reduce to the dynamical friction case. 

Angular Momentum: Binney & Tre mainel (119871) consider 
this problem from the perspective of the angular momentum- 
space in which galaxy mergers are allowed. This approach 
is similar to the capture estimates above, but accounting for 
capture into orbits as well. Whether or not such orbits will 
merge is, of course, somewhat ambiguous - it is likely that 
some significant fraction are stable, and will not merge, while 
others decay rapidly owing to resonance between the disk cir- 
cular frequencies and the orbital frequency. Nevertheless, this 
serves to bracket the range of likely merger configurations. 

2.1.2. Synopsis of model and uncertainties 

Thus, to summarize our approach: at a given red- 
shift, we calculate the halo ma ss function n(Afha io) for 
our adopted cosmology following ISheth et al.l ((2001). For 
each halo, we calculate the (weakly mass and redshift de- 
pendent) subhalo mass function (or distributi on of sub- 
halos PrN subhaiolMgubhaioi ^haiol ) following IZentner et all 
(120051) and iKravtsov etall (120041). A l ternati ve ly, we have 
adopte d it directly from iGao etalJ (120041): rNurmietalJ 
(2006) or calculated it following Ivan den Bosch et alj (12005): 
IVale & Ostrikerl (120061) . and obtain similar results. Note that 
the subhalo masses are defined as the masses upon accretion 
by the parent halo, which makes them a good proxy for the 
hosted galaxy mass (IConrov et al.l l2006) and removes the un- 
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certainties owing to tidal mass stripping. 

Mergers are identified by the basic criteria described above. 
We populate these halos and subhalos with ga laxies follow- 
ing th e empirica l halo occupation mo dels of IConrov et al.l 
(2006) (see also Vale & Ostriker 2006) normalized directly 
with group observations following Wan g et al.l (120061) at z = 
(considering instead the occupation fits in lYang et al.l 120031; 



Cooravl 120051 l2006t IZheng et al.l [20051; Ivanden Bosc h et all 
20061 makes little difference). This determines both the mean 



stellar mass and dispersion in stellar masses of galaxies hosted 
by a given halo/subhalo mass P(M ea i | M su bhaio), which (op- 
tionally) can be broken down separately for blue and red 
galaxy types. 

Figure [3] shows the mean galaxy mass as a function of 
halo mass from this model at z = 0. Since the halo occupa- 
tion models consider stellar mass or luminosity, we use the 
baryonic and stellar m ass Tully-Fisher relations calibrated by 
iBell & de Jongl (12001 1) to convert between the two. (We have 
also compared the global baryonic mass function estimated 
in this m anner with that observationally inferred in Be ll et all 
(2003a) and find good agreement). If necessary, we calcu- 
late the galaxy-galaxy merger efficiency/timescale using the 
different estimators described above. Figure [3] also shows 
the expected merger efficiency as a function of halo mass 
for these mean values (i.e. probability of hosting a subhalo 
within the appropriate mass range convolved with the calcu- 
lated merger timescale). The qualitative features are as ex- 
pected from Figure |2] The different merger timescale estima- 
tors agree well at large halo masses, with the dynamical fric- 
tion treatment yielding a somewhat longer (factor < a few) 
timescale at intermediate masses (but this is near the regime 
of low Mh a io/Mg a i where the dynamical friction approxima- 
tion is least accurate). 

The main elements and their uncertainties in our model are: 

1. Halo Mass Function: We begin by computing the over- 
all halo mass function. There is very little ambiguity in this 
calc ulation at all red shifts and masses of interest (z < 6; see 
e.g. lReed et aHl2007l) . and we do not consider it a significant 
source of uncertainty. 

2. Subhalo Mass Function: The subhalo mass function 
of each halo is then calculated. Although numerical simula- 
tions and semi-analytic calculations generally give very simi- 
lar results (especially for the major-merger mass ratios of in- 
terest in th is paper, as opposed to ve ry small subhalo popula- 
tions; see lvan den Bosch et al.ll2005l) . there is still some (typi- 
cal factor < 2) disagreement between different estimates. We 
therefore repeat most of our calculations adopti ng both our 
"default" subhalo mass fu nction calculation (IZentner et al.l 
120051: iKravtsov et al . 2004) a nd an alternative s ubhalo mass 
function calculation (Ivan den Bosch et a l. 2005) (normalize d 
to match cosmological simulations as in IShaw et ail 2006), 
which bracket the range of a number of different estimates 
(e.g. ISpringel et alj200UlTormen et al.ll2004l;lDe Lucia et al l 
l2004l:lGao et al .120041: iNurmi et al.l2006l) and demonstrate the 
uncertainty owing to this choice. The difference is ultimately 
negligible at M^i > 10 I0 M Q at all redshifts, and rises to only 
a factor ~ 2 at M„ a i < 10 10 M© (probably owing to differences 
in the numerical resolution of different estimates at low halo 
masses). 

3. Halo Occupation Model: We then populate the cen- 
tral galaxies and "major" subhalos with an empirical halo 
occupation model. Although such models are constrained, 
by definition, to reproduce the mean properties of the ha- 
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FIG. 3. — Illustration of basic elements of importance to where galaxy- 
galaxy mergers occur. Top: Average central galaxy stellar (dotted) and bary- 
onic (solid) mass a s a function of host halo mass, in our t ypically adopted halo 
occupation model iConroy et al. 2006; Vale & Ostriker 2006, black), and the 
alternate halo occupation model from Yang et al. 1 2003, green; only baryonic 
mass shown) Middle: Corresponding halo-to-galaxy mass ratio. Bottom: Av- 
erage major merger timescale/efficiency (calculated as in the middle panel of 
Figure [2] but for the appropriate mean Mg^iMh^)). Timescales are deter- 
mined as described in the text, from dynamical friction (dot-dashed), group 
capture (solid), or angular momentum (long dashed) considerations. 

los occupied by galaxies of a given mass/luminosity, there 
are known degeneracies between parameterizations that give 
rise to (typical factor ~ 2) differences between models. We 
therefo re again repeat all o ur calculati ons for our "default" 
model (IConrov et alj I2006I) (see also IVale & Ostrikerll2006l) 
and an alternate halo occupation model (|Yang et al.l2003l) (see 
also lYan et al l l2003t IZheng et all I2005I). which bracket the 
range of a number of calculations (e.g., ICoorayll2005L l2006t 
Zhen g et al.ll200l Ivan den Bosch et alJl2006l) . Ag ain, we find 
this yields negligible differences at M g . d \ > 10 M (as the 
clustering and abundances of massive galaxies are reasonably 
well-constrained, and most of these galaxies are central halo 
galaxies), and even at low masses the typical discrepancy rises 
to only ~ 0.2 dex. 
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We note that we have also considered a variety of pre- 
scriptions for the redshift evolution of the halo occupation 
model: including that directly prescribed by the quoted 
models, a complet e re- derivation of the HOD models of 
IConroy et al.1 (120061) and IVale & Ostrikeri (|200 ^) at different 
redshi ft s from the ob s erved m ass function s of iFontana et al.l 
(2006); Bundyetd] d2005l) : iBorch et all d2006t) : iBlantonl 
(2006) (see § 13.11 ). or simply assuming no evolution (in terms 
of galaxy mass distributions at fixed halo mass; for either all 
galaxies or star-forming galaxies). We find that the resulting 
differences are small (at least at z < 3), comparable to those 
inherent in the choice of halo occupation model. This is not 
surprising, as a number of recent studies suggest that there is 
very little evolution in halo occupatio n parameters (in terms of 
mass, or relative to L*) wit h redshift dYan et alJl2003tlCooravl 
120051: IConroy et al.l 120061) . or equivalently that the masses 
of galaxies hosted in a halo of a given mas s are primarily 
a function of that halo m ass, not of redshift dHevmans et al.l 
l2006t IConroy et al]|2007l) . This appears to be especially true 
for star-forming and ~ L* galaxies (of greatest importance 
for our conclusions; IConroy et al.| [2007). unsurprising given 
that "quenching" is not strongly operating in those systems to 
change their mass-to-light ratios. 

4. Merger Timescale: Having populated a given halo and 
its subhalos with galaxies, we then calculate the timescale for 
mergers between major galaxy pairs. This is ultimately the 
largest source of uncertainty in our calculations, at all red- 
shifts and masses. Again, we emphasize that some of our 
calculations are completely independent of these timescales. 
However, where adopted, we illustrate this uncertainty by pre- 
senting all of our predictions for three estimates of the merger 
timescale: a simple dynamical friction formula, a group cap- 
ture or collisional cross section estimate, and an angular mo- 
mentum (orbital cross section) capture estimate, all as de- 
scribed above. At large masses and redshifts z < 2.5, this is 
a surprisingly weak source of uncertainty, but the estimated 
merger rates/timescales can be very different at low masses 
M ga \ < 10 10 M Q and the highest redshifts z ~ 3 - 6. 

At low masses, this owes to a variety of effects, including 
the substantial difference between infall or merger timescales 
and the timescale for morphological disturbances to be ex- 
cited (different in e.g. an impact approximation as opposed to 
the circular orbit decay assumed by dynamical friction). 

The difference in redshift evolution is easily understood: 
at fixed mass ratio, the dynamical friction timescale scales 
as ?df oc ?h oc p~ 1//2 , but a "capture" timescale will scale with 
fixed cross section as t oc l/(n (erv)) oc so that (while the 
details of the cross-sections and dependence of halo concen- 
tration on redshift make the difference not quite as extreme 
as this simple scaling) the very large densities at high redshift 
make collisional merging increase rapidly in efficiency. The 
true solution is probably some effective combination of these 
two estimates, and the "more appropriate" approximation de- 
pends largely on the initial orbital parameters of the subhalos. 
At present, we therefore must recognize this as an inherent 
uncertainty, but one that serves to bracket the likely range of 
possibilities at high redshifts. 

2.2. Where Mergers Occur 

We are now in a position to predict the statistics of mergers. 
First, we illustrate some important qualitative features. Fig- 
ure H] shows the merger efficiency (as in Figure [2j for differ- 
ent classes of mergers: major mergers with the central galaxy 
in a halo, minor mergers with the central galaxy, and major 



mergers of two satellite galaxies in the halo. We show the re- 
sults for our "default" model, adopting the dynamical friction 
merger timescale, but the qualitative results are independent 
of these choices. The key features are expected: major merg- 
ers are efficient at small group scales (halo masses) compa- 
rable to or just larger than the average host halo mass for a 
given Mgai. At larger Mh a i , major mergers become more rare 
for the reasons in § 12.11 However, although dynamical fric- 
tion times increase, the rapidly increasing number of satellite 
systems in massive halos means that minor merger accretion 
onto the central galaxy proceeds with a relatively constant ef- 
ficiency. This will not trigger substantial quasar or starburst 
activity or morphological transformation, but may be impor- 
tant for overall mass growth in l arge cD galaxies, although 
recent cosmological simulations (Mailer et al. 2006) suggest 
that major mergers dominate minor mergers in the assembly 
of massive galaxies (although their simulation does not extend 
to the largest cD galaxies). 

Satellite-satellite minor mergers are a small effect at all 
masses, as expected (by the time a halo is sufficiently massive 
to host a large number of satellites of a given M ga i, the orbital 
velocity of the galaxies about the halo is much larger than 
their individual internal velocities). In what follows, we will 
generally ignore satellite-satellite mergers. Including them 
is a very small correction (generally <C 10%), and their dy- 
namics are uncertain. Moreover, their colors and star forma- 
tion histories are probably affected by processes such as tidal 
stripping, harassment, and ram-pressure stripping, which we 
are neither attempting to model nor test. We have however 
checked that there are no significant or qualitative changes to 
our predictions if we (naively) include the satellite-satellite 
term. 

Although the consequences of the merger will be very dif- 
ferent, the efficiency with which two galaxies merge does 
not depend strongly on whether they are star-forming or 
red/passive (all else being equal). It is therefore a conse- 
quence that, at low redshifts, gas-rich mergers are generally 
relegated to low stellar masses and field environments where 
such galaxies are common. Figure [5] illustrates this. We plot 
the mean efficiency of major, central galaxy mergers (as in 
Figure @] but for the mean M ga i at each Mhaio) as a function 
of halo mass at each of three redshifts. At each redshift, we 
divide this into the observed fraction of red and blue galax- 
ies at the given galaxy/halo mass, using the appropriate ob- 
served, type-separated galaxy mass functions. The efficiency 
of mergers at a given halo and galaxy mass does not evolve 
(note that this is not a statement that the overall merger rates 
will not change, but rather a statement that the same galaxies 
in the same halos will merge at the same rate). However, at 
low redshifts, red galaxies dominate the mass budget, whereas 
at high redshifts, most galaxies are still blue (star-forming) in 
all but the most massive halos. We will discuss the possi- 
bility that mergers themselves drive this change in the blue 
and red fractions in Paper II, but for now illustrate that the 
locations of gas-rich and dry mergers reflect where gas-rich 
and gas-poor galaxies dominate the population, respectively, 
which is empirically determined at the redshifts of interest 
here. We note that our halo occupation models do not ex- 
plicitly model a dependence of halo populations on central 
galaxy properties; i.e. the tentative observational suggestion 
that, at fixed halo and galaxy mass, red central galaxies are 
prefe rentially surrounded b y red (as opposed to blue) satel- 
lites dWeinmann et al.ll2.006l) . If real, the effect of such a trend 
is to make the transition plotted in FigureHsomewhat sharper 
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FIG. 4. — Merger efficiency (arbitrary units; denned in the same manner 
as the lower panel of Figure [5] with different linestyles in the same style for 
various mass galaxies) for different classes of mergers. Using the subhalo 
mass functions and halo occupation models, we can separate major mergers 
onto the central galaxy in a halo (top), minor (mass ratio > 3 : 1 but < 10 : 
1) mergers onto the central galaxy (middle), and satellite-satellite mergers 
(bottom). Major mergers occur efficiently in central galaxies near the small 
group scale for each M ga i. When galaxies live in very massive halos, they 
experience a large number of minor mergers from the satellite population. 
Satellite-satellite mergers are a relatively small effect at all galaxy and halo 



- this has little effect on our conclusions, but does somewhat 
lower the predicted gas-rich merger rates (and corresponding 
predicted quasar luminosity density) at z < 0.5 (since a red 
central galaxy would have a lower probability of an infalling, 
gas-rich system). 

Integrating over the appropriate galaxy populations, Fig- 
ure [6] compares the predicted z = merger fraction as a func- 
tion of halo mass from this model with that observed. The 
agreement is good over a wide dynamic range. Although 
there is a significant (factor ~ 2) systematic difference based 
on how this fraction is calculated, this is within the range of 
present observational uncertainty. It is also important to dis- 
tinguish the merger fraction of parent halos (i.e. fraction of 
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FIG. 5. — Merger efficiency (arbitrary units; calculated as in Figure [2) 
as a function of halo mass (adopting the mean M Ba i(Mi i a i ) from F igure [3}. 
Using the type-se p arated galaxy mass functions from Bell et al. 12003b!): 
Borch et al. (2006); Fontana et al. (2004) at z = 0, 1, 2, respectively, we show 
the fraction of galaxies at each mass expected to be gas-rich and gas-poor, 
at each of three redshifts. At high redshifts, all but the most massive merg- 
ing galaxies will be gas-rich, whereas at low masses the gas-poor population 
dominates at most masses where mergers are efficient. 



groups which contain a merger) and that of galaxies (i.e. frac- 
tion of all galaxies at a given M ga ] orMh a i which are merging), 
as at large halo masses the rate of mergers onto the central 
galaxy could remain constant (giving a constant merger rate 
per halo), but the inefficient merging of the increasingly large 
number of satellites will cause the galaxy merger fraction to 
fall rapidly. 

We also show the distribution of mergers (interacting pairs) 
and all galaxies in environmental density (local projected sur- 
face density £5 = 5/(7rc/|), where is the distance to the 
fifth nearest-neighbor) from the local group catalogues of 
ISol Alonso et al.l (120061) - we compare this data set directly 
to our predicti on by converting E <; to Mhain using th e mean 
relation from ICroton et a l. (2006), as in Bald rvetalJ (|2006) 
(although as they note, the relation has considerable scatter). 
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Similarly, we show the post-starburst (gen erally merger rem- 
nant) fraction from lHogg et al.l (120061) and lGotol (l2005h . as a 
function of surface density on large scales. 

Our predictions and the observations emphasize that galaxy 
mergers occur on all scales (in halos of all masses), and in all 
environments. In a global sense, there is no preferred merger 
scale. That is not to say that mergers of galaxies of a particular 
mass do not have a preferred scale (indeed, in our modeling, 
this is explicitly the small group scale), but rather because 
this scale is a function of galaxy mass, mergers of some mass 
occur in all halo masses and environments. It is clear that it 
is a mistake to think that mergers would not occur in field (or 
even void) environments, a fact which is very important to the 
formation of spheroids and quasars in these locations. 

2.3. How Mergers Are Influenced By Environment 

Figure [6] demonstrates that, all else being equal, mergers 
do not depend on the large scale environment. This is con- 
ventional wisdom, of course, because mergers are an essen- 
tially local process. However, there is one sense in which 
the merger rate should depend on environment. If the local 
density of galaxies (supply of systems for major mergers) is 
enhanced by some factor 1 + 6, then the probability (or rate) 
of major mergers should be enhanced by the same factor. 

In detail, our adopted model for the merger/capture cross 
section of galaxies (§ 12. U allows us to calculate the differen- 
tial probability that some halo/subhalo or galaxy population at 
a given distance r will merge with the central galaxy in a time 
< ?h- Given the observed gala xy-galaxy corr elation function 
as a function of stellar mass dLietal] 12006a). we can triv- 
ially calculate the mean number density of galaxies (possible 
fuel for major mergers) in a shell dr at r, and combining this 
with the merger rate/cross section calculation determines the 
differential contribution to the total merger rate of galaxies of 
that mass, from pairs at the separation dr. This can be thought 
of as either a capture process from halo/subhalo orbits, or a 
global inflow rate from dynamical friction and gravitational 
motions; the results are the same, modulo the absolute merger 
rate n ormalization dBinnev & Tremainelll987t [Masiedi et al. 
2006). Next, assume that the density of these companions 
is multiplied, at this radius, by a factor 1 + S r (relative to the 
mean ((1 + 6 r )) expected at that r for the given central halo 
mass). Integrating over all radii, we obtain the total merger 
rate/probability, with the appropriate enhancement. 

Figure [7] illustrates this, calculated in several radial shells 
using our gravitational capture cross sections to estimate the 
enhancement (the other cross sections yield similar results). 
The absolute value of the probability shown will be a function 
of galaxy mass, halo mass, and redshift, but the qualitative 
behavior is similar. Unsurprisingly, density enhancements on 
small scales (r < lOOkpc, where most systems will merge) 
linearly increase the merger rate accordingly. Note that den- 
sity decrements decrease the merger rate only to a point - 
this is because even for a galaxy with no companions within 
a lOOkpc radius, there is of course some non-zero probabil- 
ity that companions will be accreted or captured from initially 
larger radii and merge in t -C fy. 

At larger radii, the enhancement is less pronounced. A 
galaxy in the center of a halo of a given mass in a ~ 3 Mpc 
overdensity is not substantially more likely to experience a 
major merger, because there is little contribution to its merger 
rate from those large radii (at least on short timescales; of 
course, over t ~ tu subhalos may be accreted from these radii, 
but by then the density structure will change and the merger 



rate will reflect that). Naturally, an overdensity at the ~ 3 Mpc 
scale implies an enhanced density within that scale. However, 
we are considering this for galaxies and halos of a specific 
mass, for which the virial radii are generally much smaller 
than these scales, so the increased density in this annulus does 
not necessarily imply an enhanced galaxy density within the 
halos themselves (for that Mhaio)> although it may affect the 
overall abundance of the halos. As a general rule, merger 
rates will scale with environmental density on scales less than 
the virial radii of the masses of interest, and be independent 
of density on larger scales. 

If the merger rate increases in regions with small-scale over- 
densities, then mergers themselves should be biased to such 
regions. To the extent that the small-scale galaxy overdensity 
around a merger traces this overdensity (which we caution is 
not necessarily true, as one of the initial galaxies in this over- 
density is, by definition, consumed in the merger), this implies 
that mergers and merger remnants should preferentially ex- 
hibit small-scale density excesses. The magnitude of this ex- 
cess is straightforward to determine: for a given galaxy/halo 
mass, the distribution of environments (densities ( 1 + 6 r ) on a 
given scale r) is known. Then, for each scale r, the calculation 
in Figure|7]gives the relative probability of a merger as a func- 
tion of overdensity. Convolving the probability of any object 
being in given overdensity with the probability of a merger in 
that overdensity gives the mean overdensity of mergers at that 
scale, i.e. 

(-^merger) _ t /^^merger(^)^ > (-^ | ^^halo) dx 

(x a ii) JxP(x\M halo )dx 

where x = (1 +6 r ). 

It is straightforward in extended Press-Schechter theory to 
calculate of the probability of forming a halo of a given mass 
in a given overdensity on a particular scale dMo & White! 
1996). However, since we are calculating a galaxy overden- 
sity in radii about the merger candidate, Poisson noise is dom- 
inant on small scales where the average number of compan- 
ions is < 1 - nevertheless it is again straightforward to cal- 
culate the probability of a given overdensity. In any case 
we account for both effects, and show the results in Fig- 
ure [8] Specifically, we show the average number of com- 
panions within a radius of a given r about a merger, for all 
field galaxies. We then multiply the field curve by the cal- 
culated overdensity of mergers as a function of r. The exer- 
cise can then be trivially repeated for the correlation function 
£(r). We compare with observed post-starburst populations 
(E+A/K+A) galaxies, and find that they display a similar ex- 
cess on small scales. As before, the difference on large scales 
is negligible - unsurprisingly, the density excess becomes im- 
portant at r < r v ; r for the typical galaxies of interest. 

Finally, we stress that the excess of companions on small 
scales does not, in this model, stem from those galaxies them- 
selves having any interaction with the central merger (rem- 
nant), but reflects a genuine small-scale overdensity (as in 
small groups), in which mergers will be more likely. 

2.4. Integrated Merger Populations Over Time 

At a given redshift, we use our model to predict the mass 
function of mergers. For clarity, we take the mass of a merger 
to be the total stellar mass of the remnant galaxy (roughly the 
total baryonic mass of the merger progenitors). This avoids 
ambiguity in merger mass ratios, tends to be observationally 
representative (since mergers are generally labeled by total 
luminosity/stellar mass), and has been shown in simulations 
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FIG. 6. — Top: Merger fraction as a function of host halo mass. The fraction of all halos (groups) predicted to host at least one major merger of galaxy mass 
> 10 10 M© is plotted (left), as is the fraction of all galaxies in halos of a given Mh a i which are merging (right). We show the predictions for several variations 
of our standard model (described in the text) used to identify all merging systems (black lines, as labeled), and adding a more detailed calculation of the actual 
timescale for the physical g alaxy mergers (bl ue line s, as labeled) and ability to morphologically identify them. Both are compared with observed merger fractions 
(points) from Sol Alonso et al. (purple circles 2006) (we convert their measured intermediate-scale densities to average halo masses following Baldrv et al. 2006; 
Kauffmann et al. 2004, shown as open and filled points, respectively). Bottom: The observed distributions (fraction of objects per logarithmic interval in galaxy 
surface density) of merger and normal galaxy environments, from the group catalogues of Sol Alonso et al. 1 2006) (left), and the fraction of recent merger remnant 
(post-starburst, K+A) galaxies as a function of galaxy surface density averaged on intermediate (1.5Mpc) and large (8 Mpc) scales (right). Mergers occur on all 
scales and in halos of all masses, without a strong feature at a particular scale. 

functions and merger fractions estimated at all z < 1.5, sug- 
gesting that our model does indeed reasonably describe the 
true nature of galaxy mergers. For comparison, we show the 
results obtained using a different halo occupation model to as- 
sociate galaxies and halos, or using a different set of simula- 
tions/models to estimate the subhalo mass functions. As noted 
in § 12.11 these choices make very little difference (consider- 
ably smaller than e.g. the systematics in the observations). 

It is not always clear, however, that observations capture 
all merging pairs (or that our definition of "all" is appropriate 
as, for some mergers, f me rger — > ?h)- Often, systems are iden- 
tified as mergers on the basis of tidal disturbances and other 
clear morphological signatures. We therefore calculate the 
mass function of systems observed in this ma nner. This re- 
quires that we adopt one of the models in § 12. 1 I for the merger 
timescale, which tells us how long it will characteristically 
take for a given merger to reach the interaction cross section 
where tidal disturbances will be excited. Then, using numer- 
ical simulations to estimate the typical duration of those fea- 
tures (in which they will be id entified by typi cal morphologi- 
cal classification schemes, see lLotz et alJ2007h . we obtain the 
observed "disturbed morphology" mass functions. We per- 
form this calculation using each of the methods for calculating 
the merger timescale described in § 12. 11 Note that the number 
of systems according to this convention can exceed that in our 
"all pairs" definition if the timescale on which disturbances 
are visible is longer than the "infall" timescale or timescale 
on which the subhalo survives (the case for very efficient in- 
f all/capture). 

At high masses, the difference between samples of merg- 
ing pairs and those of disturbed systems is small, as is the 
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FIG. 7. — Dependence of the merger rate/probability on environmental 
density decrement/enhancement within a given radius r; i.e. galaxy overden- 
sity (1 +<5 r )/((l + S r )) at a fixed galaxy and host halo mass (absolute units 
are arbitrary here, and depend on these quantities). On scales less than the 
typical virial radii of interest, the merger rate increases with overdensity (lin- 
early at 8 r ^> 1), but it is independent (for a fixed halo mass) of large-scale 
environment. 

to be a better proxy for the merger behavior than the initial 
mass of either progen itor (as long as it is still a major merger; 
Hop kins et alJl2006ah . 

Figure [9] shows the mass functions of ongoing mergers at 
each of several redshifts. We first consider the mass func- 
tion of "all" objects which will merge efficiently - i.e. the 
mass function of merging pairs. This requires no knowl- 
edge of the actual timescale of the merger or e.g. lifetime 
of tidal disturbances. The results agree well with the mass 
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FIG. 8. — Excess galaxy overdensity on small scales predicted for mergers 
from our model. Because mergers are more likely when there is a galaxy 
overdensity on small scales (Figure[7), mergers will, on average, occur in re- 
gions with slightly enhanced small-scale densities. We show the real-space 
correlation function (bottom; technically the merger-galaxy cross correlation 
function) and correspondi ng number of companions within a given radius 
(top) of all field galaxies ( Goto 2005), and then this multiplied by the pre- 
dicted excess on small scales for mergers (essentially integrating over the 
probability bias to large overdensity on small scales in Figure [7). Dashed 
blue lines indicate the errors in our estimate from the combination of uncer- 
tainties in the field galaxy correlation function, the range of galaxy masses 
considered (which slightly shifts the physical scale on which the effect is im- 
portant), and the inclusion/exclusion of Poisson noise in the distribution of 
overdensities for a given halo mass. The observed number of companions 
and clustering of po st-starburst ( likely merger re mnant) galaxies is shown 
for comparison, from lGotol j2005, red circles) and H ogg et al i 1 2006, purple 
diamonds). 

difference between our choice of methodology in calculating 
the merger abundances and/or timescales. This is because 
high-mass systems merge more quickly, excite morphologi- 
cal disturbances more easily on first passage, and are brighter 
(making faint morphological features easier to identify). At 
very low masses M ga \ < 10 10 M Q , our predictions do diverge 
- this is because the overall infall or merger timescale can 
become substantially longer than the timescale over which 
morphological disturbances are excited (in these cases, this 
occurs closer to the final coalescence). Although this con- 
clusion merits more detailed numerical investigation in future 
work, it has little effect on any of our predictions - for ex- 
ample, the total merger fraction (especially at high redshift) is 
restricted to larger-mass M ga i > M* systems, where the predic- 
tions agree well, and the overall merger mass density is nearly 




log( M aal /M s ) 

FIG. 9. — Mass functions (in terms of the remnant stellar mass) of ongoing 
mergers at each of several redshifts (labeled). Observ ed mass functions (s olid 
red points) are shown from Xu et al. ( 2004, stars) and Bundv et al. 1 2005, cir- 
cles) (for a detailed analysis of the mass functions, see Hopkins et alJ2007J) . 
EiTor bars do not include cosmic variance. Observed merger fractions (open 
orange points), converted to a mass function estimate o v er the mass range 
sampled ( horizon tal errors) are shown from Bell et al. 1 200(J cross) and 
Lotz et al. (2006b, squares), with errors including cosmic variance. We com- 
pare the prediction of our default model (thick solid black line), for the abun- 
dance of mergers and merging pairs. Dotted line employs a different halo 
occupation model, and dashe d line a dopts a different fit to the subhalo mass 
functions (see Figure [6]and jj |2.1.2t . We also show the predictions for mor- 
phologically identified mergers (thin blue lines), which requires estimating 
the merger ti mesc ale/capture efficiency and duration of morphological distur- 
bances (see S I2.lt . We estimate these using a group capture/collisional model 
(solid), angular momentum capture cross-sections (long dashed), and simple 
dynamical friction considerations (dotted), calibrating the duration of distur- 
bances from numerical simulations ( Lotz et al. 2007). At masses 
there is little difference owing to methodology. At very low masses, simula- 
tions suggest that the merger timescale (i.e. orbital or crossing time after first 
passage) is considerably longer than the time period over which strong dis- 
turbances are excited; however, this is below the mass scales of interest for 
most of our predictions. 

identical regardless of the methodology. Furthermore, quasar 
and galaxy formation processes are probably influenced (or 
even dominated) by other mechanisms (such as secular disk 
instabilities and quenching via infall as a satellite galaxy) at 
these low masses, which we do not attempt to model. 

We next integrate the mass functions in Figure [9] above a 
given mass limit to predict the merger fraction as a function 
of redshift, shown in Figure [10] The fraction i s deter mined 
relative to the mass functions in Font ana et al.l d2006l) . who 
provide a continuous fit over the range of interest. But we note 
that since this is an integrated quantity, the difference ado pt- 
ing other mass function estimates (e.g. iBorch et"a l. 2006) is 
small (at least at z < 1 .5). Comparing this to a range of obser- 
vations, the agreement is good, especially for the deeper mass 
limit. For high mass mergers (M ga i > 1 1 1 M© ) there is greater 
scatter in the observations, which most likely owes to cosmic 
variance (especially at z < 0.2). In both cases, however, the 
merger fraction is not an especially steep function of redshift. 
In fact, between z = 0.3- 1.5, the fraction increases by only 
a factor ~ 3 — 4, consistent with most observatio ns finding 
a relatively flat merger fraction in this range (e.g. iLin et al.l 
l2004t iL otz et al. 2006b) and recent cosmological simulations 
(Mail er et al.ll2006l) . Further, although halos may be merging 
more frequently at high redshift, they are also merging more 
rapidly, meaning that the fraction merging at any instant can 
be relatively flat. 

Finally, given our model for the halos hosting mergers, it is 
straightforward to calculate the predicted clustering properties 
of those mergers. Specifically, we have already predicted a 
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FIG. 10. — P redicted merger fra ction as a function of red shift (lines, same style as Figure[9}, above two approxim ate mass limits. Observat i ons (p oints) 
are shown from Patton et al. (2002, filled inverted triangles), Conselice et al. 12003, filled circles), Bundv et al. 12004, filled triangles), Lin et al. (2004, open 
diamo nds), IXu et al . (2004, open s tars). | D e Propris et al. (20051 °P en circl es), Cassata et al. (2005, fill ed diam onds), Wolf et al. (2005, fille d stars). IBundv et all 
12005, open triangles), Lotz et al. (2006a, open inverted triangles), Lotz et al. (2006b, open squares), Bell et al. (2006, filled squares), and Bridge et al. (2007, 
X 's). Note that the mass limit is only approximate in several of these cases, as they are selected by optical luminosity. The predicted merger fractions agree well, 
especially for the deeper case which resolves M t galaxies. 

number density of mergers as a function of halo mass, galaxy 
mass, and redshift; i.e. some « me rgei-(A^gai l^haio, z). Knowing 
the clustering amplitude or bias of each host halo b(Mh a \ \ z), 
it is straightforward to predict the clustering of the merging 
galaxies, in the same manner by which halo occupation mod- 
els construct the clustering of a given population: 



/ MMh a lo)«mei-g e r(M ga i | M ha lo) dM hak) 
/ "mei-ger(M ga l | M halo ) dM halo 



(3) 



We calc ulate b{M ba [ ) follo wing iMo & White! (1 19961) as up- 
dated by ISheth et all (1200 lh to agree with the results of nu- 
merical simulations. 

Figure QT| shows this as a function of redshift. Since ob- 
servations generally sample near M ga i ~ M*, we plot this 
for M ga ] = M*(z = 0) w 10 n M Q . We compare with avail- 
able clustering measurem ents for like l y majo r-merger popu- 
lations. At low redshifts, iBlake et all J2004) have measured 
the clustering of a large, uniformly sele cted sample of post- 
starburst (E+A/K+A) galaxies in the 2dF. l!nfante et al.l(l2002l) 
have also measured the large-scale clustering of close galaxy 
pairs selected from the SP SS at low redshift. At high redshift, 
no such samples exist, but Blain et al. (2004) have estimated 
the clustering of a moderately large sample of spectroscopi- 
cally identified sub-millimeter galaxies at z ~ 2-3, which as 
discussed in §Q]are believed to originate in major mergers. 
Our prediction is consistent with these constraints - however, 
given the very limited nature of the data and the lack of a uni- 
form selection criteria for ongoing or recent mergers at differ- 
ent redshifts, we cannot draw any strong conclusions. 

One caution should be added: recent higher-resolution sim- 
ulations suggest that the approximation here (and in many - 
but not all - halo occupation models), that bias is a function 
only of halo mass at a given redshift, may not be accurate 
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FIG. 1 1. — Comparing our predicted clustering of ~ M t major mergers 
(lines; style as in Figure|9) as a function of redshift to that various populations 
usually associated with galaxy mergers (points): post-starbur st (E+A/K +A) 
galaxies (Blake et al. 2004, star), close galaxy pairs (Infante et al. 2002, dia- 
mond), and sub-millimeter galaxies (Blain et al. 2004, square). 



(e.g JGao & White1l2006l lHarker et alJ 120061 IWechsler et aD 
2006). In particular, because mergers have particularly re- 
cent halo assembly times for their post-merger masses, they 
may represent especially biased regions of the density dis- 
tribution. Unfortunately, it is not clear how to treat this 
in detail, as there remains considerable disagreement in the 
literature as to whether or not a significant "merger bias" 
exists (see, e.g. | Kauffmann & Haeh nelt 2002; Perciv aTet alJ 
12003b iFurlanetto & Kamionkowskill2006l: iLidz et al.l l2007b). 
Furthermore the distinction between galaxy-galaxy and halo- 
halo mergers (with the considerably longer timescale for most 
galaxy mergers) means that it is not even clear whether or not, 
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after the galaxy merger, there would be a significant age bias. 

In any case, most studies su ggest the effect i s quite small : 
using the fitting formulae from Wec hsler et all d200l [2006), 
we find that even in extreme cases (e.g. a Mhaio 3> M* halo 
merging at z = as opposed to an average assembly redshift 
Zf ~ 6) the result is that the standard EPS formalism under- 
estimates the bias by w 30%. For the estimated character- 
istic quasar host halo masses and redshifts of interest here, 
the maximal effect is < 10% at all z = 0-3, much smaller 
than other systematic effects we have considered (and gen- 
erally within the range of our plott ed variant calculation s in 
Figure fTTTl. This is consistent with iGao & W hite ( 2006|) and 
ICroton et al.l (120071) who find that assembly bias is only im- 
portant (beyond the 10% level) for the most extreme halos or 
galaxies in their simulations, where for example the clustering 
of small halos which are destined to be accreted as substruc- 
ture in clusters (> 10 15 h M©) will be very different from the 
clustering of similar-mass halos in field or void environments. 
Indeed, our own calculation in Figure [8] suggests that merger 
bias applies only on small scales, and that mergers show no 
preference for excess densities on the large scales for which 
the linear bias description is meaningful. The effect may grow 
with redshift, however, so care should be taken in extrapolat- 
ing the predictions in FigureQT]to higher redshifts. For further 
discussion of th e effects on the data a nd predictions shown 
here, we refer to lHopkins et al.1 (l2007dl) . 

For the sake of future comparison, we show in Figure[T2lour 
predictions for the merger fractions and clustering of mergers 
(Figure [TOl & fTTl respectively) at all redshifts z = 0-6. We 
note the caveat that our merger fraction is defined relative to 
the mass functions in Fon tana et al.ld2006l) . which become un- 
certain at high redshifts, although this uncertainty is compara- 
ble to the differences between the method s of calculating the 
merger timescale (as discussed in § |2.1.2i . It is also less clear 
what the observable consequences of mergers at the highest 
redshifts may be - if merger rates are sufficiently hig h, there 
may be a large number of multiple mergers (as in iLi et all 
(2006b)), or systems may effectively be so gas rich that merg- 
ing preserve s disks and operates as a means of "clumpy ac- 
cretion" (e.g. [Robertson et al. 20063). 

Although the estimates differ at the highest redshifts, we 
stress that their integrated consequences at low redshifts z < 
3 are similar, as this is where most merging activity and 
spheroid/BH mass buildup occurs. We also note that high- 
redshift mergers are likely to be the most massive M ea i ^> M* 
systems, so we show our predictions for the clustering of 
mergers assuming different mass limits (as opposed to strictly 
at M ga i = M„). We also plot the mass flux in mergers, i.e. 
the integrated rate at which galaxy baryonic/stellar mass is 
merged, J M ga in(M ga i)dlogM ga i. This compares favorably 
with the observationally inferred rates at which mass is moved 
off the blue cloud, through the "green valley," and onto the 
red sequence (from the evolution in galaxy mass fu nctions 
and color-magnitude relations; see MartjnetaL|[2007), as ex- 
pected in a model where mergers drive such a transition (for 
details, see Paper II). Future observations of these quantities 
at high redshift will improve the constraints on our halo oc- 
cupation and merger timescale estimates, allowing for more 
accurate calculations of e.g. quasar triggering and spheroid 
formation rates at these epochs. 
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FIG . 12. — Top: As Figure[K)] but extending our predicted merger fractions 
to high redshift. Middle: Mass flux through mergers (i.e. total rate of stellar 
mass merging). Black points are observed merger fractions converted to an 
estimated mass flux rate following Hopkins et al. (2007a). Green, red, and 
blue circle show the observationally inferred mass flux through the "green 
valley" (i.e. from blue cloud to red sequence), rate of growth of the red se- 
quence, and rate of mass loss off the blue cloud (respectively), from z ~ 0— 1 
I Martin et al. 2007) (see Paper II for a more detailed comparison). Bottom: 
As Figure 1111 but extended to higher redshift. Blue and red lines show the 
clustering of mergers above the given mass thresholds. 

3.1. Consequences of Merger-Driven Fueling: What 
Determines Where and When Quasars Live 

Having developed in § [2] a physically-motivated model of 
merger rates as a function of galaxy and halo mass, environ- 
ment, and redshift (and tested that this model is consistent 
with the existing body of merger observations), we can now 
extend our application. As discussed in § Q] the argument 
for an association between mergers and quasars has a long 
history. We therefore make the simple ansatz: Every major 
merger of star-forming/gas- rich galaxies triggers a quasar. 
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FIG. 13. — Predicted quasar luminosity density, if quasars are triggered in mergers, as a function of redshift. Left: Prediction from a simplified toy model in 
which all halos hosting ~ L„ galaxies undergo major mergers near their characteristic small group mass scale, and build a BH which obeys the appropriate Mbh — 
Mh a i„ relation for that redshift (estimated Mbh ~^thn\o as a function of redshift from Hopkins et al. 2007d; Fine et al. 2006; Hopkins et al. 2007c, corresponding 
to solid, long dashe d, and dot-dashe d lines, respectively). Points show observational estimates from the measured QLFs of Ueda et al. (200 31 red circles), 
Hasinger et al. 12005, blue triangles), Richards et al. (2005, green diamonds), and the large compilation of multiwavelength QLF data in|Hopkins et al. j2007d 
black s tars). The observations from specific bands are converted to a bolometric luminosity density using the bolometric corrections calibrated in Hop kins et all 
1 2007e). Right: Same, but the predicted luminosity density is calculated properly accounting for all galaxy and halo masses from the merger rate functions 
determined in §|2] and adopting the observed ratio of BH to host galaxy spheroid mass as a function of redshift (e.g. Peng et al. 2006). Linestyles correspond to 
different means of estimating the exact merger rates, as in Figure|9] Red lines assume all mergers will trigger quasars, black (lower) lines assume only gas-rich 
("wet") mergers can trigger bright quasar activity (adopting the observed fraction of gas-rich/star-forming/blue galaxies as a function of M„ a ] and M(, a i as in 
Figure[5). A merger-driven model naturally predicts both the rise and fall of the global quasar luminosity density to high precision. 



From this statement, we can make a number of robust pre- 
dictions. In §|2] we derived the characteristic host halo mass 
for mergers of ~ M* galaxies. To the extent that these are gas- 
rich systems, this should therefore also represent the charac- 
teristic host halo mass of quasars, and (since the mass density 
of the Universe is dominated by systems near ~ M*) dominate 
the buildup of black hole mass. 

From the Soltan| (119821) argument, the black hole mass den- 
sity of the Universe must be dominated by growth in typi- 
cal, bright quasar phases with canonical radiative efficiency 
e r ~ 0.1. Let us construct the simplest possible model: merg- 
ers (of M.J, galaxies) characteristically occur at a host halo 
mass ~ M merger . From the halo mass function, it is straight- 
forward to calculate the rate at which halo mass crosses this 
mass threshold, 



Phalo = P - 



dt 



(4) 



where F(> Mhaio, z) is the fraction of mass in halos of mass 
greater than Mhaio, deter mined from t he Pre ss-Schechter for- 
malism revised following She th et ail (1200 ll) . Assume that ev- 
ery such halo undergoes a merger approximately upon cross- 
ing this mass threshold, which transforms its galaxy from disk 
to spheroid. The hosted BH mass therefore grows from some 
arbitrarily small amount to the expected mass given the BH- 
host mass relations, which we can write as Mbh = Kz)Miaio 
(we distinguish this from Mbh = /i(z)M ga i). The ratio v{z) is 
determined to z ~ 3 from the clusterin g of active BHs of a 
given mass at each redshift (see e.g., I da An gela et al] 120061 : 
iFine et aLll20Q6t iHop kins et al. 2007d c), and indi rectly from 
determ inations of the BH host galaxy masses dPeng et all 
2006). The total rate at which BH mass is built up is then 



dF(> M ha io) 

dt 



and the bolometric luminosity density is j D oi = Ej-Pbhc 2 . Fig- 
ure [l3]compares this simple estimate with the observed bolo- 
metric quasar luminosity density as a function of redshift. 

The agreement is striking, which suggests that this toy 
model, such that the bulk of the assembly of BH mass oc- 
curs near the transition halo mass, is reasonable. This also 
naturally explains the rise and fall of the quasar luminosity 
density with time. However, this is ultimately just a simple 
approximation - we can consider this in greater detail adopt- 
ing our previous estimate of the merger rate as a function of 
stellar mass and redshift, ri(M g . d \\z), from § [2] Each major 
merger transforms disks to spheroids, building a BH of aver- 
age mass Mbh = p{z)M g . d \. We should properly only consider 
mergers of gas-rich or star-forming systems, as dry mergers 
will, by definition, not be able to trigger quasar activity and 
form new BH mass. Therefore, we empirically adopt the frac- 
tion of red and blue galaxies at each M ga i, Mhaio (as in §[2j) to 
restrict only to mergers of blue galaxies. Again, n{z) ha s been 
directly determined from observations (Peng et al. 200 g), an d 
estimated from theoretical arguments (Hop kins et al.ll2007cl) . 
For conve nience, we adopt the n umerical best-fit estimate of 
/i(z) from Hop kins et al.l d2007d) . A good approximation to 
this numerical function is 



fi(z) » 0.0012 



(it 



l +z 5/2 



(z/1.775) 5 / 2 



(6) 



(5) 



which matche s the asymptotic ob s erved values at low and 
high redshift (lHaring & Rixl l2004t IWalter et al l 120041) . and 
captures the observed we ak evolution to z ~ 1 and rapid evo- 
lution between z = 1 - 3 (IShields et al.ll2003l:lPeng et al.ll2006h 
ISalviander et alj|2006h . Given the merger rate n(M ga i |z), we 
can then convert this to a cosmic rate of formation or build-up 
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FIG. 14. — Predicted BH mass f uncti on (BHMF) from gas-rich merger- 
driven quasar/BH formation (Figure [T"3l right). Results are shown at z = 
(black lines; linestyles correspond to different calculations of the merger 
rates, as in Figure[9), and z = 1, 2, 3 (blue, green, and red, respectively; for 
clarity, only our fiducial calculation - solid line - is shown, but relative evo- 
lution with redshift for each calculation is similar). Yellow (shaded) rang e 
shows the z, = observational estimate of the BHMF in Marconi et afj 420041) . 
Integrating forward the merger mass functions as a function of redshift yields 
a good match to the local BHMF. The effect of dry mergers is included, but 
is small. 

of BHs in merger-driven quasars, 

'UMbh \z) = J P(M BH | M gal )«(M ga i | z)dlogM gal . (7) 

The intrinsic dispersion about the mean BH-host mass rela- 
tion appears, at all redshifts, to be roughly lognormal with 
width w 0.27 dex, so we model P(Mbh 1-^gai) as such. Once 
the total rate of formation of BH mass is calculated, the same 
conversion above yields the quasar luminosity density. 

Figure [13] shows the results of this more detailed calcu- 
lation. They are similar to the results from our extremely 
simplified model - which reflects the fact that most of the 
mass/luminosity density is contained near M* or L„. Note 
that considering all mergers (i.e. including dry mergers) over- 
predicts the quasar luminosity density at low redshifts. This 
demonstrates that the decrease in the quasar luminosity den- 
sity at low redshifts is, in part, driven by the fact that an in- 
creasing fraction of massive systems have already been trans- 
formed to "red and dead" systems at late times, and are no 
longer available to fuel quasars, even if they undergo subse- 
quent dry mergers. By z ~ 0, for example, a large fraction 
(~ 50%) of the mass density in > M* systems has already 
been gas-exhausted (discussed in detail in Paper II), and there- 
fore such mergers are no lo nger a viable fuel supply for quasar 
activity. As discussed in § 12.21 the predicted gas-rich merger 
mass density (and corresponding quasar luminosity density) 
at z < 0.5 will be slightly lower if these gas-exhausted sys- 
tems are preferentially surrounding by gas-exhausted satel- 
lites (compared to gas-rich central galaxies of the same mass 
in similar halos), but it is clear in Figure [13] that this is com- 
pletely consistent with the observations (especially if secu- 
lar processes contribute significantly to the quasar luminosity 
density at low redshifts and luminosities, as we expect from 
our comparisons in § 13.3b . 

Having calculated the rate of BH formation as a function of 
the remnant BH mass, «(Mbh | z), it is trivial to integrate this 
forward and predict the BH mass function (BHMF) at any 
time. Figure [141 shows the result of this calculation at z = 0, 
compared to the observationally estimated BHMF. The two 
agree well at all masses, even at very large Mbh ~ 10 10 M Q . 
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FIG. 15. — Predicted quasar clustering as a function of red- 
shift, assuming merger-triggering (black lines, as in Figure [9), cor- 
responding to the small group scale of ~ M* galaxies. Red (up- 
per) shaded range show the prediction if quasars were associated with 
large group scales, blue (lower) range show the prediction from a sec- 
ular model in which quasar clustering traces that of star-forming galax- 
ies observed at each redshi ft (lines show dz 1 a range estimat ed from 
the compiled observation s in Hopkins et al. 1 2007d), from Shepherfe^aU 
(200l|) ; IGiavalisco & Dickinson 12001); Norber g"etaT| 420021); ICoil et all 
120041); Zehavi et a l.1 <2005T); lA delberger et al. 12005); Allen et alj 120051) ; 
IPhleps et aiT 72006); Men eux et alj 420061); ILee et 311)20061) ). Points show 
quasar clustering measur ements from ICroom et al.1 420051 red squares), 
Porciani & Norberg 12006, green diamonds), Mvers et al. (2006a c, cyan and 
blue circles), and da Angela et al. (2006, violet stars). Large black stars 
show the observed clustering of z ~ 1 small groups (of ~ L* galaxies) from 
ICoil et akl 420061) . corresponding to the most efficient scales for major ~ L, 
galaxy mergers. Quasar clustering measurements are consistent with the 
small group scale in which mergers proceed efficiently. 

We also show the BHMF at several other redshifts. Interest- 
ingly, there is a downsizing behavior, where a large fraction 
of the most massive BHs are in place by z = 2, while less 
massive BHs form later (essentially required by the fact that 
few ~ 10 9 M Q BHs are active at l ow redshift, while a very 
high fraction are active at z ~ 2, seelMcLure & Dunlopll2004t 
Kollmeier et al. 20061 iFine et alll2006h . If we were to ignore 
dry mergers at low redshifts, this effect would be even more 
pronounced, but at z < 1 their effect is to move some of the 
BH mass density from lower-mass systems into higher mass 
> 10 9 M Q systems (at higher redshifts, the effects are negligi- 
ble). It is not obvious, however, that this translates to downsiz- 
ing in galaxy mass assembly, since the ratio of BH to galaxy 
mass /i(z) evolves with redshift. We will return to this ques- 
tion in Paper II. 

Since we begin our calculation with the halos hosting 
quasars, we should be able to predict the bias of quasars as 
a function of redshift. As in Figure QT] we use the known 
clustering of the halos hosting mergers to calculate the clus- 
tering of those mergers as a function of redshift. Assuming 
each merger produces a quasar of the appropriate mass, this 
yields the expected clustering of quasars as a function of red- 
shift. Figure [15] compares this prediction to observed quasar 
clustering as a function of redshi ft. Te chnically, we adopt the 
quasar lightcurve models from § !3.2l below to determine the 
clustering specifically of L» quasars (i.e. determining the rel- 
ative contribution to from different host masses and their 
clustering as in Figure ITTV but the result is nearly identical to 
assuming that L* quasars trace M* mergers (Figure ITTb. This 
should be true in any model, as long as the quasar lifetime is a 
smooth function of luminosity or host mass. We also compare 
with the directly observed clustering of small groups similar 
to our definition. 
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FIG. 16. — Top: Characteristic halo mass implied by quasar clustering mea- 
surements. Points show the \a allowed range in host hal o mass M(, a i corre- 
sponding to the quasar bias measurements in Figure [T5l (in the same style). 
Shaded magenta regions show t he range o f halo masses for the correspond- 
ing redshift bins in the SDSS I Shen et al. 2007). The solid line shows the 
best-fit MhaioCz) to all observations, with the \a (2<r) allowed range shaded 
orange (cyan). Middle: Shaded range again shows the characteristic host halo 
mass implied by quasar clustering. Points show the halo mass scale implied 
by direct measurements of observationally identified small groups (velocity 
dispersions < 200 kms~'), from Brough et al. (2006) at z ~ (squares), and 
from from clustering measurements of groups from Eke et al. 1 2004], trian- 
gles) and Coil et al. ( 2006, stars). Bottom: Same, but showing the small group 
halo mass estimated indirectly from the empirically determined halo occupa- 
tion distribution (HO D). Black inverted triangles adopt the best-fit HOD from 
IConrov e t alj 120 09) (our defa ult model), other points adopt the methodol- 
ogy of [Vale & Ostriker ( 2006) t o construct the HOD from vario us measured 
galaxy stellar mass func tions inj Fontana et al. (2006, blue stars), B orch et all 
12006, purple squares), Bundy et al. (2005, 2006, red circles), and Blanton 
12006, orange triangles). The characteristic scale of ~ L* quasar hosts ap- 
pears to robustly trace the characteristic small group scale of ~ L* galaxies; 
i.e. the mass scale at which galaxy mergers are most efficient. 

The agreement is quite good at all z < 2. At higher red- 
shifts, the observations show considerably larger scatter, per- 
haps owing to their no longer being complete near the QLF 
L* - future observations, sufficiently deep to clearly resolve 
L*, are needed to test this in greater detail. We also consider 
the predicted clustering if L» quasars were associated with the 
large group scale of M* galaxies (for simplicity we take this 
to be halo masses > 5 — 10 times larger than the small group 
scale, where our halo occupation model predicts of order > 3 
satellite ~ M* galaxies), and the expectation from a secular 
model, in which quasar clustering traces the observed clus- 
tering of sta r-forming galaxies (ta ken from the observations 
collected in Hopkins et al. 2007d) - neither agrees with the 
observations. Note that these estimates may not be applicable 
to the highest-redshift quasar clustering measurements, where 
flux limits allow only the most massive L>L, systems to be 
observed (but see Figure Q~2] for how the clustering amplitude 
varies with merger masses). 



We can invert this, and compare the empirically determined 
scales of quasar host systems with the small group scale which 
should dominate gas-rich ~ L» galaxy mergers. Figure [16] 
shows the mean host mass Mhaio which corresponds to var- 
ious quasar clustering measurements (i.e. range of Mhaio for 
which the expected quasar bias agrees with the observed ± 1 a 
range). We compare this with direct measurements of the 
halo masses corresponding to small groups of ~ M* galaxies, 
determined from both clustering measurements and velocity 
dispersion measurements of observationally identified groups 
with dispersions a < 200 km s . We can also estimate the 
appropriate small group scale from the halo occupation for- 
malism. 

Spe cifically, following the formalism of IConrov et alj 
(2006), if galaxy luminosity /mass is monotonic with subhalo 
mass (at the time of subhalo accretion), then we can take any 
galaxy mass function, monotonically rank it and match to our 
halo+subhalo mass functions, and obtain a new halo occupa- 
tion model which predicts a small group scale - i.e. the range 
of halo masses at whi ch satellites of mass ~ L* first appear. 
As discussed in § 12.11 the choice of mass functions and how 
the HOD is constructed makes little difference (factor < 2) to 
our predictions, so (unsurprisingly) these all yield a similar 
estimate of the small group scale to our default model predic- 
tions. 

At all observed redshifts, the scale of ~ L* quasars appears 
to trace the small group scale - i.e. whatever mechanism trig- 
gers ~ quasars operates preferentially at the characteristic 
small group scale for ~ L* galaxies, where mergers are ex- 
pected to be most efficient. 

In § 12.31 we demonstrated that the increased probability 
of mergers in regions with excess small scale overdensities 
means that the typical merger is more likely to exhibit an ex- 
cess of clustering on small scales, relative to average systems 
of the same halo mass. If quasars are triggered in mergers, 
this should be true as well. We therefore apply the identical 
methodology from Figure [8] to calculate the excess clustering 
signal expected in active quasars. Figure [17] shows the re- 
sults of this exerc ise. We adopt the lar ge-scale mean cluster- 
ing expected from Myers et al. (2006b), specifically using the 
formulae of ISmith et al.l (120031) to model the expected non- 
linear correlation function in the absence of any bias, then 
apply the formalism from Figure [8] to estimate the additional 
bias as a function of scale. Comparing this to observations, 
the measurements clear ly favor an excess bia s on small scales 
(r < 100- 200/;"' kpc: iHennawi et al.l 120061) . similar to our 
prediction, over a constant bias at all scales. This appears to 
be true at all observed redshifts; the excess relative bias we 
predict at small scales is simply a consequence of how the 
probability of a merger scales with local density, so it does 
not vary substantially as a function of redshift. 

It should be noted that the excess of quasar clustering on 
small scales might also reflect an excess of merging binary 
quasars, i.e. merging systems in which the interaction has 
triggered quasars in each merging counterpart. For the rea- 
sons given in § [T] this situation is expected to be relatively 
rare (ev en if all quasars are in itially triggered by galaxy merg- 
ers), but M yers et all (l2006bl) note that only a small fraction of 
merging pairs need to excite quasar activity in both members 
in order to explain the observed clustering excess. Figure [171 
demonstrates that a similar excess is observed in both the 
quasar-quasar autoc orrelation function (IHennawi et al J 120061 
iMvers et"aL 2006b) a nd the quasar-galaxy cross-correlation 
function "(ISerberetalJ l2006). arguing that it primarily reflects 
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Eddington ratio me asurements ( Mc Lure & Dunlod l2004t 
i Kollmeier et alj2006b active BH mass functions dVeste rgaard 
l2004tlFine et al.ll2006tlGreene & Hd2007l) , an d cosmic back- 
ground measurements (IVolonteri et alJ l2006t iHopkins et alJ 
I2007eh . 

The quasar luminosity function <f>(L) is given by the convo- 
lution over the merger rate (rate of formation of BHs of final 
mass Mbh in mergers) and quasar lifetime (differential time 
spent at luminosity L by a BH of final mass Mbh): 



<p(L)= / f G (L|MBH)«(MBH|z)dlogM B H. 



(8) 



FIG. 17. — Exce ss small-scale clus tering of quasa rs expected if they are triggered in mergers, as in Figure [8] Left: Observed correlation functions from 
IMvers et all (2006b, blue circles) and Hennawi et al. 12006, green diamonds), me asured for ~ L* qu asars over the redshift ranges z. ~ 1-2. Dashed black 
line shows the expected correlation function (nonlinear dark matter clustering from Smith et al. ( 2003), multiplied by the appr opriate constant large-scale bias 
factor) without a small-scale excess. Red lines multiply this by the predicted additional bias as a function of scale from § 12.31 namely the fact that small-scale 
overdensities increase the probability of mergers. Solid line shows our mean prediction, dashed the approximate ~ 1<t range, as in Figure[8] Center: Same, but 
dividing out the best-fit large-scale correlation function (i.e. bias as a function of scale). Black squares in upper panel show the measurement for true optical 
quasars (-23.3 > M, > -24.2) from lSerber et all 120061) at z. ~ 0.1-0.5. Right: Ratio of the mean bias at small radii (r < 100 /T 1 kpc) to that at large radii 
(the asymptotic values in the center panel), at all redshifts where this has been observed. Lines show the predicted excess from the previous panels (lower line 
averages down to a minimum radius r = 50/j kpc, upper line to a - potentially unphysical - minimum r = 10/T 1 kpc). 

a genuine preference for quasar activity in small-scale over- 
densities. In any case, however, the excess on small scales 
is a general feature of a merger-driven model for quasar ac- 
tivity. Indeed, the predicted exces s is also seen in hi gh- 
resolution cosmological simulations (Thacker et al. 2006), if 
quasars are specifically identified with ("attached to") major 
mergers. Secular (bar or disk instability) fueling mechanisms, 
on the other hand, should (by definition) show no clustering 
excess relative to median disk galaxies of the same mass and 
properties, in contrast to what is observed (although in agree- 
ment with what is seen for low-luminosity Seyfert galaxies, 
see §13.31). 

3.2. Model-Dependent Predictions: Additional 
Consequences of Quasar Light Curves 

To proceed further, we must adopt some estimates 
for quasar lightcurves an d/or lifetimes. Fol l owin g the 
methodology develope d by ISpringel & Hernquistl d2003l) and 
Springel et al. (2005b), Hop kins et all (l2006allbl) use a large 
set of several hundre d hydrodynamical simulations (see 
iRobertson et alj|2006bl) of galaxy mergers, varying the rele- 
vant physics, galaxy properties, orbits, and system masses, 
to quantify the quasar lifetime (and related statistics) as a 
function of the quasar luminosity. They define the quan- 
tity /,2(L|Mbh), i.e. the time a quasar of a given BH mass 
Mbh (equivalently, peak quasar luminosity L pea k) will be ob- 
served at a given luminosity L. They further demonstrate 
that this quantity is robust across the wide range of var- 
ied physics and merger properties; for example, to the ex- 
tent that the final BH mass is the same, any major merger 
of sufficient mass ratio (less than ~ 3 : 1) will produce 
an identical effect. We adopt these estimates in what fol- 
lows, and note that while there is still considerable uncer- 
tainty in a purely empirical determination of the quasar life- 
time, the model lightcurves are consistent with the present 
obser vational constraints from variability stud ies (|Martinil 



2004 



2005 



and references th erein) clustering (ICroometalJ 
Adel berger & Steide]||2005fc iPorciani & Norberdl200a 



Myers et aljr2006at Ida Angela et all 120061: IShen et al.ll2007l) . 



Note this technically assumes fg <C ?h, but this is true for all 
luminosities and redshifts of interest here. Figure [18] shows 
this prediction at a number of redshifts, comp ared to the 
large c ompilation of QLF measurements from Hopki nTet alJ 
(2007e). The agreement is surprisingly good at all redshifts. 
At the most extreme luminosities Lboi > 3 x 1O 14 L0 at each 
redshift, our predictions may begin to fall short of the ob- 
served QLF, but this somewhat expected, as these luminosi- 
ties naively imply > 10 10 M Q BHs accreting at the Edding- 
ton limit. It is therefore likely that a full resolution at the 
most extreme luminosities involves either revising the esti- 
mate of these bolometric luminosities (i.e. the bolometric cor- 
rections adopted may not be appropriate for the most ex- 
treme objects, or there may be beaming effects) or includ- 
ing processes beyond the scope of our current investigation 
(e.g. super-Eddington accretion or multiple mergers in mas- 
sive BCGs). Nevertheless, our simple merger-driven scenario 
appears to accurately predict the distribution and evolution of 
most quasar activity. 

Integrating the QLF over the appropriate range, we triv- 
ially obtain the active fraction, and can calculate this sepa- 
rately for each host mass M ga i or BH mass Mbh in Equa- 
tion (8). Figure [19] compares this to observations at both 
low and high redshift, for systems with m = L/L&&& > 0.1, 
representative of typical Seyfert and quasar populations (e.g. 
iMcLur e & Dunlop 2004). Note that the quasar lifetime in- 
tegrated above this threshold is close to a constant value 
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FIG. 18. — Predicted quasar lum inosity functions, convolving our predicted merger rate functions (Figure [9] same line styles) with quasar lightcurves from 
simulations (Hopkins et al. 2006a). Red lines allow dry mergers to trigger quasar activity as well (leading to an overesti mate at low reds hifts, as in Figure [\3\ . 
Points show observed bolometric luminosity functions at each redshift, from the compilation of observations in Hopkins et al. (2()07e). QLF measurements 
derived from observations in the optical, soft X-ray, hard X-ray, mid-IR, and narrow emission lines are shown as green, blue, red, cyan, and orange points, 
respectively. The merger-driven model naturally predicts the observed shape and evolution of the QLF at all redshifts. 

volving over the contributions to each merging range in Mbh 
from all Mhaio- Figure [20] plots the expected bias as a func- 
tion of luminosity a t each of several redshifts. As origi- 
nally demonstrated in lLidz et al.l (12006). our model for quasar 
lightcurves and the underlying triggering rate of quasars pre- 
dicts a relatively weak dependence of clustering on quasar lu- 
minosity. Here, we essentially re-derive this result with an 
a priori prediction of these triggering rates, as opposed to 
the pu rely empirical (fitted to the QLF) rates from lLidz et al.1 
(2006), and find that the conclusion is robust. However, 
this prediction is not necessarily a consequence of merger- 
driven models (nor is it unique to them) - w e show the pre- 
diction s fr om the semi-analytic m o dels o f IWvi the & Loeb 
d2002l) and iKauffmann & Haehneltl (120001) , who adopt sim- 
plified "lightbulb"-like quasar ligh tcurves (for a detai led dis- 



< 10 8 yr, similar to observational estimates (Martini 2004). 
At very low masses/levels of activity, other fueling mecha- 
nisms may be d ominant - for co mparison with e.g. the ac- 
tive fractions in lHao et all (120051) of typical < 1O 7 M BHs 
(< 10 1() Mq hosts), we refer to secular and/or "stochastic " ac- 
cretion model s in disks (e.g.lHopkins & Hern auist 2006|) and 
old ellipticals dSimoes Lopes et alT2 007). Furthermore, at the 
lowest masses plotted, the typical AGN luminosities become 
extremely faint (typical M B > -18 in M g . di < 10 10 M Q hosts), 
and so such systems may be more oft en classified as non-AGN 
or typical star-forming systems (e.g.[Rodighiero et alj|2007h . 
At high levels of accretion, however, the merger-driven pre- 
diction agrees well with observations at low and high red- 
shift, and predicts a downsizing trend similar to that seen - 
namely that from z = 2 to z = 0, quasar activity has been par- 
ticularly suppressed in the most massive systems (although it 
has been suppressed to some extent at all host masses), pre- 
sumably owing to the conversion of these systems to "red and 
dead" spheroids without cold gas su pplies (see Paper II). 

We next follow ILldz et al.l (|2006), and extend Equation (O 
to convolve over the expected bias of the active systems at 
each quasar luminosity L, 



Hopkins et al. 2007d). 



b(L): 



1 



b(M BU )t (L\M BH )ri(M BH \z)dlogM Bll , (9) 



where b(M B u) is determined just as b(M g . d y) in § 12.41 by con 



cussion of these differences, seeL 

The reason for the weak dependence of quasar clustering 
on luminosity in Figure |20]is, in fact, the nature of the quasar 
lightcurve. Quasars grow rapidly in mergers to a peak quasar 
phase at the final stages of the merger, which exhausts and 
expels the remaining gas, after which the quasar decays to 
lower luminosities. This decay moves objects of the same 
host properties to fainter luminosities in the QLF, making the 
clustering properties flat as a function of luminosity. Thus, 
while an important test of our modeling (that the correct halos 
and galaxies host quasars of the appropriate luminosities), this 
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FIG. 19. — Predicted AGN fraction as a function of host properties. Top: 
Low-redshift quasar fraction (defined here by Eddington ratios m > 0.1) as 
a function of galaxy mass. Black lines show the prediction of our merger- 
driven model, in the style of Figure [9] Ob served fractions are sho wn down 
to (roughly) their completeness limit, from Kauffmann et al. 1 2003). Bottom: 
Same, but at z m 2, with the AGN fraction determine d observationally in LBG 
)Erb et aj]|2006D and S"-selected I Kriek et al. 2006) samples. Some caution 
should be applied at Mmi < 10 i0 Mq, as the AGN luminosities become suf- 
ficiently low that even moderate star formation will dominate the observed 
luminosity and systems may not be classified as AGN. 

is not a unique prediction of merger-driven models. 

We can also use our model to estimate the infrared lumi- 
nosity functions of various populations versus redshift. By 
construction, our assumed halo occupation model reproduces 
the observed star-forming (blue) galaxy mass function at each 
redshift. Using the corresponding fitted star-formation histo- 
ries as a function of baryonic mass from lNoeske et alj d2007l) 
(which fit the observations locally and their evolution at least 
to z ~ 1 -5), we immediately obtain an estimate of the star for- 
mation rate function in "quiescent" (non-merging) galaxies at 
each redshift. We include a scatter of ~ 0.25 dex in SFR at 
fixed stellar mass, comparable to that observed (in blue galax- 
ies), but this makes relatively little difference, as the most ex- 
treme SFR populations are domin ated by mergers. We then 
adopt the standard conversion from Ken nicuta d 19981) to trans- 
form this to an infrared luminosity function (where we refer 
to the total IR 8 - 1000 /im luminosity). 

Our model also yields the mass function of gas-rich merg- 
ers, for which we can estimate their d istribution of star for- 

1 (120061 . 



mation rates. In Hopkins et al. (2006f), we quantify the dis- 
tribution of star formation rates as a function of galaxy prop- 
erties from the same large suite of simulations used to esti- 
mate the quasar lifetime. Essentially, this quantifies the "life- 
time" above a given SFR in a merger, which can be reasonably 
approximated as a simple function of galaxy mass and (pre- 



f(>M*) = f*r 0, 



My/gas/?*' 1 (l0) 

where Mf is the post-merger galaxy mass (i.e. our M ga i) and 
f* k 0.3 Gyr is a fitted characteristic time. This functional 
form simply amounts to the statement that there is a mean 
characteristic timescale f* in which most of the gas mass of the 
merger (My/ gas ) is converted into stars, which we find is (un- 
surprisingly) similar to the dynamical time of the merger and 
to observational estimates of the charac teristic star fo rmation 
timescale in starbursts and ULIRGs (Kennicutt 1998). Since 
the fitted star-formation histories of iNoeske et alJ (120071) im- 
plicitly define a gas fraction as a function of time (or can be 
used in combination with the Schmidt-Kennicutt star forma- 
tion law to infer the gas fraction), we simply adopt these for 
the pre-merger galaxies (but we have checked that they cor- 
rectly reproduce o bserved gas fractions as a function of mass 
at z = 0, 1, 2; see lHopkinsetaT]|2007ch . It is worth noting 
that, with this estimate, the explicit dependence on / gas can be 
completely factored out in Equation ( fTOl . and we can write it 
as an estimate of the amount of time a system spends above a 
given enhancement in SFR (basically a merger enhances the 
r-model SFR by ~ t/**), relative to the pre-merger SFR. Us- 
ing the same SFR to Lir conversion, we obtain a rough esti- 
mate of the IR luminosity function of mergers. 

Finally, adopting the empirically calcula ted obscured frac - 
tion as a function of quasar luminosity from lGilh et alJd2007l) . 
and assuming that the obscured bolometric luminosity is re- 
radiated in the IR, we convert our predicted bolometric QLF 
to an IR QLF of obscured quasars. Technically, not all of 
the luminosity will be obscured, of course, but we find that 
e.g. using the full distribution o f column densities as a func- 
tion of quasar luminosity from lUeda et all (120031) to attenu- 
ate a template AGN SEP yi elds a very similar answer (see 
also lFranceschini et al.ll2.Q05h , as does using a mean X-ray to 
IR bolometric correction of obscured AGN (lElvis et "aTll 1 994t 
IZakamska etai]|2004 IPolletta et alJl2006l) . Including the IR 
contribution from un-obscured quasars is a negligible correc- 
tion. 

Figure |2T| compares the resulting predicted IR luminosity 
functions to observations at z = - 2, and to the observed 
IR luminosity density from z ~ 0-5. At all redshifts, the 
agreement is good, which suggests that our model accurately 
describes the star-formation history of the Universe. This 
should be guaranteed, since at all redshifts the quiescent pop- 
ulation dominates the ~ optical and IR luminosity func- 
tions (hence also the star formation rate and IR luminosity 
densities) - at this level, we simply confirm that our halo oc- 
cupation model is a good approximation. However, at high 
luminosities, typical of ULIRGs, the populations are gener- 
ally dominated by mergers and (at the highest luminosities) 
obscured AGN. 

We explicitly quantify the transition point as a function of 
redshift in Figure [22] (we show the comparison there just for 
our "default" model, but as is clear in Figure [2T] the transi- 
tion between different populations dominating the LF is sim- 
ilar regardless of the exact version of our model adopted). 
Our comparisons generally affirm the conventional wisdom: 
at low redshift, mergers dominate the ULIRG and much of the 
LIRG populations, above a luminosity ~ 1O 1L4 L0, with heav- 
ily obscured (potentially Compton-thick) AGN (in starburst 
nuclei) becoming a substantial contributor to IR luminous 
populations in the most extreme > a few x 1O 12 L systems 
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FIG. 20. — Left: Predicted bias as a fun ction of quasar lumino sity from our merger-driven model (black lines, style as in Figure|9j. To contrast, the expected 
bias b(L) from the semi-analytic models of Wyithe & Loeb 1 2002, cyan) and Kauffmann & Haehnelt 1 2000, orange with diamonds) are plotted (dot-dashed lines); 
these a dopt simplified ( constant or exponential "on/o ff") quasar lightcur ves. Points a r e measu rements fromlCroomet^al . 1 2005, red squares), Adelberger & Steidell 
1 2005, orange crosses), [Porciani & Norberg 1 2006, purple diamonds), Myers et al. 1 2006a, blue circles), da Angela et al. (2006, magen ta stars), andlCoil et all 
12007, black open circles). For ease of comparison, all luminosities are converted t o bolom etric luminosities using the corrections from Hopkins et al. 1 2007e). 
Vertical blue dotted lines show L* in the QLF at each redshift, from Hopkins et al. 1 2007e). Right: The best-fit slope of the dependence of bias on luminosity at 
the QLF L t , i.e. d(fc/£>«)/dlog ( L/L<,) , where b* = b(L,) . Points are determined from the observations at left, with the observations from Myers et al. 1 2006a, 
cyan circles) and Grazian et al. 1 2004); Wake et al. 1 2004, black open diamond) added. Lines are in the style of the left panel, with the red d ashed line showin g 
no dependence of bias on luminosity. Adopting an a priori model for merger-triggered quasar activity reproduces the empirical prediction from Lidz et al. 1 2006), 
that quasar bias should depend weakly on quasar luminosity. 
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FIG. 21 . — Left: Predicted total IR (8— 1000 fim) luminosity functions at different redshifts (as labeled). Green, blue, and red lines shows the estimated contribu- 
tion from non-merging systems, star formation in merge rs, and obscured AG N in mergers, res pectively. Linestyles a r e as in Figure|9] for th e variants of the merger 
calculations. Points show observational estimates from Saunders et al. 1 1990, magenta stars), Soifer & Neugebauer 1 1991, blue triangles), Yun et al. 12001, black 
circles), Le Floc'h et al. 1 2005, black diamonds), Chapman et al. 1 2005, black inverted triangles), Babbedge et al. (2006, black squares), and Caputi et al. 1 2007, 
black X 's). Right: Integrated IR luminosity density. Solid lines show the contributions from non-merging systems (green), star formation in mergers (blue), and 
obscured quasars in mergers (red). Blue dotted shows the total (star formation +AGN) merger contribution, black dashed shows the total from all sources. Orange 
points show ob s erva tional estimat es of pir from the compilat ion of Hopkins 1 2004, circles; only the direct IR observations therein are plotted here), as well as 
ILe Floc'h etaf] f20Ol diamonds), |P erez-Gonzalez et al. 12005), and Caputi et al. l 2007|, x's). Red stars show the bolometric quasar luminosity density from 
Figurefl3] rescaled by a constant (mean) obscured-to-unobscured ratio of ~ 2 : 1. The agreement in all cases is good - our model reproduces the star formation 
history of the Universe and distribution of star formation rates and bolometric luminosities. 



24 



Hopkins et al. 



Mergers Dominate 



13.5 








'Quiescent Galaxies Dominate 

1 2 3 

z 



(Obscured) AGN Dominate'. 




4 



Star Formation Dominates 

1 2 3 

z 



5F 



Fig. 22. — Left: Total IR luminosity, as a function of redshift, above 
which mergers (star formatio n+A GN) dominate the total IR luminosity func- 
tions (solid line, from Figure |2T1 dashed lines show the range above which 
25/75% of systems on the luminosity function are mergers). Point shows 
the corr esponding transitio n point (and range) observed in low-redshift sys- 
tems {Sanders et al. 1988b). Right: Same, but for the transition between star 
formation (in non-merging+merging systems) and (obscured) AGN dominat- 
ing the IR luminosity functions (generally a factor ~ a few larger luminosity 
than the quiescent system-merger transition). Poi nts show the obser ved esti- 
mates from comparison of PAH feature strengths in lLutz et alj fl998, low red- 
shift) and Saiina et al. 12007, high redshift). A similar estimate is obtained (at 
low redshift) from comparison of emission line strengths I Sanders & Mirabel 
rn^lKewlev et aljUOOl) . full SED template fitting JFarrah et al.H2003l), or 
indire ct comparison with Type 2 AGN luminosity functions I Chary & Elbaz 
1200 ID . The model predicts the local transitions, and that by z > 1, the LIRG 
population is dominated by quiescent star formation in gas-rich systems (even 
as the total and fractional luminosity density in mergers increases rapidly). 



(nearing hyper-LIRG > 1O 13 L0 luminosities which are com- 
mon bolometric luminosities for > 10 8 M© BHs near Edding- 
ton, but would imply potentially unphysical > 1000 M yr -1 
SFRs). At higher redshifts, disks are more gas-rich, and thus 
have characteristically larger star formation rates, dominating 
the IR LFs at higher luminosities. By z ~ 1, most LIRGs 
are quiescent systems, and by z ~ 2, only extreme systems 
> a few x 1O 12 L0 are predominantly mergers/AGN. 

This appears to agree well with recent estimates of the tran- 
sition between AGN and passive star formation dominating 
the bolometric luminosities of high-redshift systems. Inter- 
estingly, this shift occurs even while increasing merger rates 
(and higher gas fractions in typical mergers) lead to a larger 
overall contribution of mergers to the star formation rate and 
IR luminosity densities. At z ~ 0, mergers contribute neg- 
ligibly to the total IR luminosity density, but by z ~ 2, they 
may contribute ~ 20-50% of the IR output of the Universe, 
with that contribution owing comparably to both star forma- 
tion in mergers and obscured BH growth (which should be 
true, given the MBH-Mh os t c orrelations and typ ical e r ~ 0.1 
radiative efficiencies; see, e.g. iLidz et al.ll2007ah . 

The integrated contribution of mergers to the star forma- 
tion rate and IR luminosity densities agrees well with obser- 
vational estimates (a vailable at z < 2; see Bell et al.i 120051: 
iMenanteau et al.l [2.0061) . and the constraint from stellar pop- 
ulation models that only a small fraction of the z = stel- 
lar mass in typical early-type galaxies was formed in the 
spheroid-forming merger itself (as opposed to more extended 
star f ormation in the pre-merger disks; e.g. Noes ke et al.1 
l2007h . For a more detailed comparison and analysis of the 
merger-induced contributio n to the star formation rate density 
of the Universe, we refer to Hop kins et al.1 (l2.Q06fh . 

We caution that the above comparisons are approximate, 
and intended as a broad check that our models are consistent 
with the observed abundance of IR luminous galaxies as a 



• Quasars (-23.3 > M, > -24.2) 
□ Seyferts (-22.0 > Mi > -23.3) 




0.01 



0.10 

[ h 1 Mpc ] 



1.00 



FIG. 23. — As Figure [171 (upper center panel), but comparing the clus- 
tering (quasar-galaxy cross-correlation) as a function of scale measured by 
ISerber et alj (2006) for bright optical quasars and dimmer Seyfert galaxies. 
Quasar clustering is consistent with our predicted excess on small scales, in- 
dicating a merger-driven origin, but low-luminosity systems show no such 
dependence, suggesting that processes independent of the local, small-scale 
density (e.g. secular processes) may dominate at these luminosities. 

function of redshift. We have ignored a number of potentially 
important effects: for example, obscuration is a strong func- 
tion of time in a merger, and may affect various luminosities 
and morphological stages differently. Moreover, our simple 
linear addition of the star formation contribution of mergers 
to the IR LF and the AGN contribution is only technically cor- 
rect if one or the other dominates the IR luminosity at a given 
time in the merger; however, there are clearly times during the 
final merger stages when the contributions are comparable. 
Resolving these issues requires detailed, time-dependent ra- 
diative transfer solutions through high-resolution simulations 
that properly sample the merger and quiescent galaxy param- 
eter space at each redshift, and is outside the scope of this 
work (although an important s ubject for future, more detailed 
study; see, e.g. iLi et aD, l2007). It would be a mistake, there- 
fore, to read too much into e.g. the detailed predictions for 
sub-millimeter galaxies or other extreme populations based 
on Figures [2T] & [22] However, most of our predicted qualita- 
tive trends, including the evolution of the luminosity density 
(and approximate relative contribution of mergers) and the 
shift in where quiescent or merger-driven populations dom- 
inate the bright IR LF, should be robust. Critically, a model in 
which merger-driven quasar activity dominates the QLF pre- 
dicts an abundance of IR-luminous galaxies consistent with 
the observations as a function of both luminosity and redshift. 

3.3. When Merger-Triggering Loses to Secular Processes 

Despite these arguments for a merger-driven origin for 
bright, high-redshift quasars, there are good reasons to be- 
lieve that most local, high-Eddington ratio objects are not re- 
lated to mergers. Most active local systems t ypically involve 
relativ ely low-mass BHs (Mbh ~ 10 7 M©; iHeckman et alJ 
2004), in Sa/b-type host galaxies, without significant evi- 
dence for recen t major interactions ( Kauff mann et al J 120031 ; 
iPierce et"ai1l2006l) . and have relatively low S eyfert-level lu- 
minosities (-21 >M B > -23: lHaoet alJl2005[) . below the tra- 
ditional Mg = —23 Seyfert-quasar divide. Given this, it is nat- 
ural to ask whether there are additional reasons to believe that 
bright quasars have distinct origins, and if so, when (or at 
what luminosities) these non-merger driven fueling mecha- 
nisms begin to dominate AGN populations. 
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In addition to the arguments in § 13.11 & 13.21 there 
are a number of qualitative differences between bright, 
high-redshift quasars and local Seyferts. Quasars have 
signific antly different clustering amplitudes (Hopki ns et alJ 
2007dl) and host stellar mass distributions ( Hopki ns et al.1 
2007a) from star-forming galaxies at z > 1, and typically 



have hosts with elliptical or merger remnant morphologies 
dFlovd et al.ll2004t [Falomo et al Jl2004t fZakamska et alteOOfl 
Letaw eetalT 20061). frequently exhibiting evidence of tidal 
disturbances (iBahcall et al.l|1997l:IC analizo & Stocktonll2001l: 
Hutchings et al.l2003ll2006l:IUrrutia et al.l2007l;lBennert et al] 
20071) . Figure [23] c ompares the cl ustering as a function of 
scale measured in lSerber et al.1 (120061) for both bright quasars 
and Seyfert galaxies - quasars exhibit the strong trend of 
excess clustering on small scales indicative of a triggering 
process which prefers small-scale overdensities, but Seyferts 
show no significant preference for local overdensities. 

Because galaxy mergers are also associated with the ter- 
mination of star formation in the remnant (even if only tem- 
porarily), i.e. a rapid post-starburst phase and transition to the 
red sequence (discussed in detail in Paper II), the decay of the 
quasar lightcurve should be associated with the reddening of 
the remnant, in a merger-driven model. This implies a par- 
ticular preferred track for quasar hosts in the color-magnitude 
diagram, illustrated in Figure [24] In this scenario, quasars 
should be associated with the crossing of the "green valley" - 
i.e. the triggering of a quasar occurs at the end of the merger, 
when young stellar populations imply a bluer-than-average 
host spheroid, and the quasar decays to lower luminosities as 
the remnant reddens onto the red sequence. 

Alternatively, if quasars were triggered in a purely secu- 
lar manner, or otherwise independent of whatever quenching 
mechanism terminates the galactic supply of cold gas, then 
their natural preferred location is in the blue cloud - i.e. blue- 
ward of the "green valley." Systems in this regime still have 
cold gas supplies and have not yet quenched. Because the 
quenching is uncorrected with quasar triggering in such a 
model, and the lack of galaxies in the "green valley" implies 
that this transition is rapid, very few quasars would be ex- 
pected to be triggered just as the quenching occurs, and there- 
fore few quasars should be present in the "green valley." 

Comparing these qualitative scenarios with observations 
appears to favor the former, merger-driven case. Quasars tend 
to live redwards of the "top" of the blue cloud, with the bright- 
est/highest accretion rate quasars preferentia lly in bluer-than- 
average spheroids in the "green valley" {K auffma nn et alJ 
2003; Sanche zlFaIll2004l:lNandra et al.ll2006l) . 

Figure [25] shows this quantitatively - we plot the distribu- 
tion of colors of quasar hosts, compared with that fitted to the 
blue cloud and red sequence, or systems with observed bars 
and/or disk instabilities (the expected quasar hosts in a secular 
model, regardless of quasar duty cycles during a bar phase), 
and post-starburst (E+A/K+A) systems, largely identified as 
merger remnants and "blue spheroids" (see the discussion in 
§ |2.31 l. The quasar hosts clearly lie preferentially between the 
blue cloud and red sequence, with a color distribution very 
similar to observed post-starburst galaxies. 

The distribution is quite distinct, however, from observed 
barred systems, which lie overwhelmingly on the blue se- 
quence with, if anything, a bias towards the bluest systems 
(which is expected, as these are the most gas-rich and there- 
fore most unstable systems). Even if one assumes that, in the 
most extreme bar instabilities, dust reddening might move the 
system into the "green valley" as a reddened disk, this ap- 
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FIG. 24. — Location of quasars in the color-magnitude diagram, expected 
from different models. Top: Red and blue dotted regions roughly outline 
the red sequence and bl ue cloud, respec tively, with the dashed line divid- 
ing the bimodality (from Bell et al. 2004). Arrows show the preferred loca- 
tion of quasar hosts in a merger driven model. At the end of a merger, a 
bright quasar is triggered in a spheroid/merger remnant at the top of the blue 
cloud (owing to the young stellar populations from pre-merger and merger- 
induced star formation), and subsequently the quasar luminosity decays while 
the remnant rapidly reddens, leaving a relatively low accretion rate remnant 
on the red sequence. Middle: Same, but for a secular triggering scenario in 
which quasar activity (which must still require cold gas) is uncorrelated with 
quenching or itself exhausts the gas supply. In this case, quasars should live 
in the blue cloud, with gas-rich systems, and their abundance rapidly drops 
approaching the "green valley" as gas supplies are exhausted. Bottom: We 
compare to observat ions of quasar host galaxy colors at z ~ 0.7 -1.1 from 
Sanchez et al. (2004, blue circles). X-ray identified AGN and quasar hosts 
from INandra et alJ (2006, orange diamonds) are also shown (the numbers 
plotted should not be taken literally, as we have rescaled the authors U—B vs. 
Mb color-magnitude relation to that shown here for the sake of direct compar- 
ison, but the result is qualitatively identical to that shown). Arrows reproduce 
the merger expectation from the top panel. Quasars appear to live in the 
region of color-magnitude space expected if they are triggered at the termi- 
nation of star formation, and subsequently decay in luminosity, as expected 
in merger-driven scenarios. 
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FIG. 25 . — Distribution of quasar host galaxy colors from Figure l24l (histograms; from Sanchez et al. 1 2004) and Nandra et al. ( 2006) in dark blue and orange, 
respectively). We compare with fitted (Gaus sian) color d istributions of blue cloud and red sequence galaxies from Strateva et al. ( 2001), with the distribution of 
colors of barred galaxies in the SDSS from Barazza et al. 1 2006) (the expected quasar hosts in a secular or in stability-driven quasar fueling model), and with the 
fitted (Gaussian) distribution of post-starburst (generally merger remnant) E+A/K+A galaxies in Goto (2005). Quasar host colors follow the "transition" between 
blue cloud and red sequence observed and expected in merger remnants, in contrast to the preferentially most gas-rich, blue hosts of observed strong bars. 

pears to contradict the observations above which find quasars 
to be in preferentially blue spheroids (even X-ray observa- 
tions, which suffer less severe bias against dust-reddened 
systems). A more rigorous quantitative comparison of the 
tracks through color-magnitude space and the relative abun- 
dances in this trans ition region will be the topic of future work 
(IWuyts et al.ll2.Q07l in preparation), and we stress that these 
are all relatively low-redshift samples, but studying how the 
mean quasar luminosity and accretion rates scale/decay with 
the degree of reddening or aging of their host stellar popula- 
tions can provide a powerful discriminant between these mod- 
els. 

There are a number of additional constraints we can place 
on the contribution to the QLF from secular fueling in non- 
merging disks. Figure[26]considers several of these. First, we 
place a limit on secular activity by asking: at a given z, what 
are the brightest QSOs possible in disk/star-forming galax- 
ies? For that redshift, we take the observed mass function of 
star forming galaxies, and convolve with P(Mbh | M ga \) to ob- 
tain the hosted BH mass function (assuming the most massive 
disks are Sa/b-type galaxies). Then, assume that every such 
BH is at its Eddington luminosity. At some point (correspond- 
ing to > 2-4M„ in the disk mass function) the number density 
of these mock quasars falls below the QLF (which declines 
much less rapidly) at that luminosity and redshift. In other 
words, at high luminosities, the required BH masses from the 
Eddington limit are too large to live in late-type galaxies. To 
be optimistic, we assume all the quasar luminosity density be- 
low this limit is contributed by secular activity in disks. This 
then gives an upper limit to the fraction of the luminosity den- 
sity from disks. We repeat this procedure for a number of 
different mass functions at different redshifts. In all cases, 
even this limit falls to a fraction <C 1 by z > 1, as the QLF L* 
reaches large luminosities corresponding to Mbh > 10 s M Q 
BHs at the Eddington limit. Given the BH-host spheroid mass 
relations, this requires a very massive spheroid, easily formed 
in a merger, but not present in even the most early-type disks. 

Second (alternatively), we assume all BHs in pseudobulges 
were formed via secular mechanisms. As discussed in § Q] 
there is good reason to believe that this is the case, whereas 
classical bulges must be formed in mergers. For a given 
z = BH population, we infer an accretion history in the 
standard fashion from m atching the BH mass function and 
continuity equations (e.g. Salu cci et"aLlll999HYu & Tremaind 



2002). We then calculate the fraction of the QLF luminos- 
ity density at a given redshift from systems which, at z = 0, 
live in pseudobulges. We consider this for several differ- 
ent observational estimates of the pseudobulge fraction as 
a function of e.g. host galaxy morphologica l type or bulge 
Sersic index ([K ormendv & Kennicutt 2004; Balcell set alj 
l2004tlAllen et al.ll2006t iNoordermeer & van der Hulstll2007h . 
and the directly e stimated pseudobulge mass functions in 
iDriver et~aT1 d2007l) . Although the details are sensitive to how 
we define pseudobulges, we find a similar result - massive 
BHs which dominate the luminosity density at z > 1 live in 
the most massive bulges/ellipticals, which are overwhelm- 
ingly classical bulges. 

Third, we calculate the probability that the observed clus- 
tering of quasars is consistent with that of star forming/disk 
galaxies (see Figure \15[. This is subject to some important 
caveats - although quasar clustering depends only weakly on 
luminosity (see Figurel20l. galaxy clustering has been shown 
to depend quite stro ngly on galaxy luminosity/stellar mass 
(Norbergetal. 2002). We use t he compilation of clustering 
data from Hop kins et al.l (l2007dl) . as in Figure[l5] At z < 1 .5, 
we specifically compare the clustering of ~ L* quasars with 
that of ~ L* blue/star-forming galaxies. For any model in 
which quasars are driven by secular activity and the statis- 
tics of quasar light curves/triggering are continuous as a func- 
tion of host mass/luminosity (i.e. there is not a second feature 
in the luminosity function introduced by the statistics of the 
light curves themselves), these should roughly correspond. At 
higher redshift, galaxy clustering as a function of type and 
luminosity/mass at ~ is not clearly resolved so we can 
only plot combined clustering of observed star-forming popu- 
lations (generally selected as Lyman-break galaxies); again 
caution is warranted given the known depende nce of clus- 
tering on galaxy mass/luminosity (for LBGs, see lAllen et ail 
2005). Fortunately, the range of particular interest here is 
Z < 1, where we again find a similar trend - quasar cluster- 
ing is consistent with secular fueling a t z ~ 0, but by z ~ 1 
this is no longer true. As discussed in Hopki ns~et al.l (|2007d), 
this appears to be contrary to some previous claims (e.g., 
Adelberger & Steidel 2005); however, in most cases where 
quasars have been seen to cluster similarly to blue galaxies, 
either faint AGN populations (not ~ L* quasars) or bright 
(^> L») blue galaxies were considered. Indeed, quasars do 
cluster in a manner similar to the brightest blue galaxies ob- 
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FIG. 26. — Fraction of the integrated quasar luminosity density owing to 
non-merger driven secular mechanisms. Top: Upper limit to the contribu- 
tion from BHs in disk galaxy hosts at each z (see text). Li mits are derived 
fro m the observed type-separ ated mass functions in Figure [T6l (same style) 
and Franceschini et al. ( 2006, cyan stars). Solid line assumes the disk mass 
function does not evolve with z- Second from Top: Fractional contribution 
from systems in pseudobulges at z = 0. Local distribution of pseudobulge 
mass es is estimated from the observed p seudobulge fraction versus galaxy 
type (Noordermeer & van der Hulst 2007, red dashed line, with ~ la shaded 
range), or assuming all bulges with Sersic in dex n < 2 are pseud obulges (with 
the distribution of n versus bulge mass from Balcells et al. 2004, black solid 
line and shading), or from directly measured pseudobulge mass functions 
(Driver et al. 2007, blue long-dashed line and shading). Second from Bot- 
to m: P robability (from \ 2 ) that observed clustering of quasars (data in Fig- 
ure llSl and star-forming galaxies refl ect the same hosts. Sol id line is derived 
from the best-fit to the compilation of Hopkins et a l. ( 2007d) points from the 
individual measurements included (see Figure |l5> . Bottom: Predicted frac- 
tion of the luminosi ty density from the the model for secular fueling from 
Hopkins & Hernquist 12006), when combined with the merger-driven model 
herein. 



serve d at several redshifts (e.g., ICoil et af]|2007t lAllen et all 
l200l at z ~ 1 and z > 2, respectively). This should not be sur- 
prising; since quasars require some cold gas supply for their 
fueling, they cannot be significantly more clustered than the 
most highly clustered (most luminous) population of galaxies 
with that cold gas. 

Finally, we compare these with a simple model expectation. 
We combin e our prediction of the merger -driven QLF with the 
model from Ho pkins & Hernq uist ( 2006) for the QLF driven 
by secular fueling mechanisms in star-forming galaxies. This 
prediction is based on a simple model of feedback-driven self- 
regulation, calculating the rate of triggering in non-merging 
disks from the observed statistics of gas properties in the cen- 
tral regions of star-forming galaxies of different types. The 
result is similar to the empirical constraints. 

All of these comparisons have important caveats. For exam- 
ple, secular mechanisms could act so quickly as to completely 
transform disks to bulges, rapidly making very large BHs (al- 
though this conflicts with the pseudobulge constraints) from 
disk hosts. Pseudobulges could form in more systems than we 
estimated, but be subsequently transformed to classical bulges 
via major mergers. Clustering could be affected by a number 
of systematic uncertainties inherent in e.g. the mass and lu- 
minosity ranges considered. However, these systematics are 
independent, and there is no single loophole which can si- 
multaneously reconcile the three constraints considered here 
with the possibility that secular fueling dominates bright ~ L* 
quasar activity at z > 1 . Although there are differences in de- 
tail, all the methods we have considered empirically suggest 
a similar scenario: secular (non-major merger related) fuel- 
ing mechanisms contribute little to quasar activity at z > 1, 
which involves the most massive M B h > 10 8 M Q BHs in the 
most massive spheroids. By z ~ 0.5, however, the most mas- 
sive BHs are no longer active, and a significant fraction of the 
quasar luminosity density can come from ~ 10 7 M Q BHs in 
undisturbed hosts. By z ~ 0, the local QLF is largely dom- 
inated by Seyfert activity in relatively small BHs with late- 
type, undisturbed host disks (Heckman et al. 2004). 

Even if we ignore these constraints, a model in which secu- 
lar fueling dominates the growth of quasars and BHs has dif- 
ficulty matching the observed rise and fall of the quasar lumi- 
nosity density with cosmic time. Figure|27]illustrates this. We 
show the observed bolometric quasar luminosity density as a 
function of redshift, compared to our estimate of the merger- 
driven luminosity density (as in Figure[T3l). We also show our 
estimate of the luminosity density which comes from systems 
which, at z = 0, live in pseudobulges, calculated as in Fig- 
ure [26] Again, this fairly moderate, empirical model of secu- 
lar activity can account for the observed luminosity density at 
low redshifts z < 0.5, but provides only a small contribution 
at high redshifts z > 1 . 

We might, however, imagine a "maximal" secular model 
in which all spheroids are initially formed by disk instabili- 
ties. Equivalently (for our purposes), albeit highly contrived, 
a model might invoke secular processes to rapidly build up 
BH mass (to the final mass that will be given by the "future" 
Mbh - c relation) before a spheroid is formed in later merg- 
ers and/or instabilities. These have severe difficulty recon- 
ciling with the kinematics of observed classical bulges (see 
§ [T]) and the tightness of the BH-host spheroid correlations, 
respectively, an d are not favored by simple dynamical argu- 
ments (see, e.g. IShen et alJl2003h . nor the constraints in Fig- 
ure [26j but they could in principle be inv oked. In fact, the 
semi-analytic model of lBower et alJ (120061) is effectively such 
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FIG. 27. — Bolometric quasar luminosity density as a function of redshift. 
Black stars show the observations from Hopkins et al. 1 2007e). Lines show 
estimates from different m ode ls (as labeled): the prediction from a merger- 
driven model (as in Figure [13) and a moderate secular model in which BHs 
in pseudobulges at z = were formed in disk instabilities (as in Figure 1261 
line in same style) are in good agreement with the luminosity density evo- 
lution and empirical constraints on clustering, host galaxy colors, spheroid 
kinematics, and disk/spheroi d mass functions. We compare a maximal secu- 
lar model, from Bower et al. 12006), in which most BHs and (even classical) 
spheroids are initially formed via disk instabilities, and an "extreme" secular 
model, in which all z = BH mass is formed in such instabilities (same as 
the maximal secular model, but with no BH growth from cooling, accretion, 
or mergers; this is unphysical but serves as a strong upper limit). In order for 
disk instabilities to dominate BH growth, they must act very rapidly, before 
the (inevitable) major mergers can exhaust gas and form massive spheroids 
- this forces such models to predict a luminosity density history offset to 
earlier times (higher redshifts) compared to the merger-driven model, in dis- 
agreement with the observations. 



a scenario, in which a very strong disk instability mode is an- 
alytically adopted, which overwhelmingly dominates initial 
bulge formation and BH growth (mergers contributing <C 1 % 
at all redshifts). We therefore compare their estimate for the 
total quasar luminosity density (accretio n rate density) as a 
function of time. Finally, in the default iBower et all (2006) 
model, there is still some growth of BHs via accretion from 
the diffuse ISM, cooling, and mergers (major and minor). We 
therefore also adopt an e ven more ex t reme secular model, in 
which we reproduce the lBower et alj (|2006) analysis with an 
even stronger disk instability mode - essentially renormaliz- 
ing the model such that all z = bulge mass was formed in 
this "secular" mode (i.e. we allow no subsequent growth via 
other mechanisms, and demand that the observed integrated 
Z = BH mass density be matched by the integrated secular 
mode growth). This latter model is of course unphysical, but 
yields a hard upper limit to secular-mode growth. 

It is immediately clear that the "maximal" secular model 
predicts that the quasar luminosity density should peak at 
much higher redshifts z ~ 4 than the observed z ~ 2. In gen- 
eral, the rise and fall of the quasar luminosity density in such a 
model are offset to earlier times. The reason for this is simple: 
in a fully cosmological model, mergers are inevitable. And, 
whether or not most quasars are triggered by mergers, it is ex- 
tremely difficult to contrive a major, gas-rich merger without 
BH accretion and spheroid formation, with most of the gas be- 
ing consumed by star formation. The only way that a secular 
or disk instability model can dominate the integrated buildup 
of BH mass and quasar luminosity density is to "beat merg- 
ers to the finish," i.e. to generally operate early and rapidly 
enough such that the BHs have been largely formed, and gas 
already exhausted, by the time massive galaxies undergo their 



first major mergers. In such models, then, one is forced to 
predict that the quasar luminosity density peaks at very early 
times and has largely declined (i.e. most of the gas in massive 
systems has already been exhausted) by z ~ 2. 

Finally, this relates to a more general point. The quasar lu- 
minosity density (and especially the number density of bright 
quasars corr esponding to > 10 8 M^ BHs at hig h Edding- 
ton ratio; see iFan et al1l2004t [Richards et al.ll2006bl) declines 
rapidly at z > 2-3 (roughly as ~ (1 +z) 4 " 6 ), compared to 
the global star formation rate density of the Universe, which 
is relatively flat at these redshifts (declinin g as ~ (1 +z)°~ 15 
from z ~ 2-6; Hopkin s~& Beacoml l2006h . This has long 
been recognized, and cited as a reason why quasars and BH 
growth cannot explain reionization at high redshifts (since, 
similar to the global star formation history, the UV back- 
ground declines slowly at these redshifts). It further implies 
that BH growth (at least at the masses of interest for our pre- 
dictions here) cannot generically trace star formation. This 
places strong constraints on secular models, as above, as well 
as models in which essentially all high-redshift star forma- 
tion is in bulges or some sort of d issipational collapse (e.g. 
iGranato eT al. 2004; Lapi et al. 2006). Some process must de- 
lay the formation of massive BHs, while allowing star and 
galaxy formation to proceed efficiently at high redshifts. A 
natural explanation is that massive BH formation requires ma- 
jor mergers. In our model, at high redshifts, low-mass galax- 
ies can efficiently form (and potentially build low-mass BHs 
via secular instabilities), but they are predominantly disks, 
which efficiently turn gas into stars and do not form very 
massive bulges or BHs. Only later, once their hosts have 
grown more massive, are they likely to undergo major merg- 
ers, which transform the disks into spheroids and build cor- 
respondingly massive BHs. This automatically explains the 
much sharper rise and fall of the quasar luminosity density 
and number density of bright quasars, relative to the shallow 
evolution in the star formation rate density and ionizing back- 
ground of the Universe at high redshifts. 

4. DISCUSSION 

We have developed a theoretical model for the cosmolog- 
ical role of galaxy mergers, which allows us to make pre- 
dictions for various merger-related populations such as star- 
bursts, quasars, and spheroidal galaxies. By combining the- 
oretically well-constrained halo and subhalo mass functions 
as a function of redshift and environment with empirical halo 
occupation models, we can estimate where galaxies of given 
properties live at a given epoch. This allows us to calculate, in 
an a priori cosmological manner, where major galaxy-galaxy 
mergers occur and what kinds of galaxies merge, at all red- 
shifts. 

We compare these estimates to a number of observations, 
including observed merger mass functions; merger fractions 
as a function of galaxy mass, halo mass, and redshift; the mass 
flux/mass density in mergers; the large-scale clustering/bias 
of merger populations; and the small-scale environments of 
mergers, and show that this approach yields robust predictions 
in good agreement with observations, and can be extended to 
predict detailed properties of mergers at all masses and red- 
shifts. There are some uncertainties in this approach. How- 
ever, we re-calculate all of our predictions adopting different 
estimates for the subhalo mass functions and halo occupation 
model (and its redshift evolution) and find this makes little dif- 
ference (a factor < 2) at all redshifts. The largest uncertainty 
comes from our calculation of merger timescales, where, at 
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the highest redshifts (z > 3), merging via direct collisional 
processes might be more efficient than merging via dynami- 
cal friction, given the large physical densities. More detailed 
study in very high-resolution numerical simulations will be 
necessary to determine the effective breakdown between dif- 
ferent merger processes. Nevertheless, the difference in our 
predictions at these redshifts is still within the range of obser- 
vational uncertainty. Ultimately, we find that our predictions 
are robust above masses M ga ] > 10 10 M Q , regardless of these 
possible changes to our model, as the theoretical subhalo mass 
functions and empirical halo occupation models are reason- 
ably well-constrained in this regime. 

In addition to these specific observational predictions and 
tests, our model allows us to examine the physical origins of 
the distribution of major mergers of different galaxy masses 
and types. For example, there is a naturally defined major- 
merger scale (host halo mass Mhaio) for galaxies of mass M„. d \ 
- the "small group scale," only slightly larger than the av- 
erage halo hosting a galaxy of mass M ga i. This is the scale 
at which the probability to accrete a second galaxy of com- 
parable mass ~ Mg a i (fuel for a major merger) first becomes 
significant. At smaller (relative) halo masses, the probability 
that the halo hosts a galaxy as large as M ga \ declines rapidly. 
At larger masses, the probability that the halo will merge with 
or accrete another halo hosting a comparable ~ M„ a i galaxy 
increases, but the efficiency of the merger of these galax- 
ies declines rapidly. We stress that this small group scale is 
indeed small - the average small group halo will still host 
only 1 galaxy of mass ~ M ga i, and groups will only consist 
of 2 - 3 members of similar mass. We also note that this 
does not mean that mergers occur (in a global sense) on a 
specific scale, since the small group scale is different for dif- 
ferent galaxy masses. In fact, a consequence of this model is 
that mergers occur in halos of all masses and in all environ- 
ments ( including field and even void environments), as is ob - 
served (ISol Alonso et al.ll2006l:lGotol2005l:lHogg et al.ll2006l) . 
although the characteristic masses and star formation histo- 
ries of galaxies merging may reflect their different environ- 
ments/halo masses. Similarly, our model allows us to accu- 
rately predict and understand the (relatively weak) evolution 
of the merger fraction with redshift, and the relative evolution 
in merger rates as a function of mass (evolution of the ma- 
jor merger mass functions). The clustering properties and de- 
pendence of merger rates on both large-scale and small-scale 
environment are natural consequences of the fundamentally 
local nature of mergers, and we study in detail the effects of 
environment on merger rates as a function of scale. 

Having characterized mergers in this way, we examine 
the role that mergers play in triggering quasars. Even if 
there are other quasar "triggers" dominant at some luminosi- 
ties/redshifts, it is difficult to imagine a scenario in which 
the strong nuclear gas inflows from a merger do not cause 
rapid, near Eddington-limited accretion and ultimately yield 
some kind of quasar - and indeed such activity is ubiquitous 
in late - stage mergers (iKomossa et al J 1 20031: lAlexander et al.1 
I2005at iBorvs et alj|2005t iBrand et al.ll2006l) . We therefore 
make the simple ansatz that gas-rich, major mergers will pro- 
duce quasars (but do, in principle, allow for other fueling 
mechanisms as well). This model, with just the contribution 
of mergers to the quasar luminosity density, is able to account 
for the observed quasar luminosity density from z = 0-6. The 
rise and fall of the luminosity density with redshift, as well as 
the shape and evolution of the quasar luminosity function, are 
accurately reproduced. This also yields predictions of the lo- 



cal black hole mass f unction, cosmic X-ray background (see 
Hopkins et al. 2006a), AGN fractions as a function of galaxy 
mass/luminosity and redshift, large scale quasar clustering as 
a function of luminosity and redshift, small-scale quasar clus- 
tering excesses, quasar host galaxy colors, and infrared lumi- 
nosity functions, all in good agreement with those observed. 
In particular, matching the history of the bolometric luminos- 
ity density of quasars requires no knowledge or assumptions 
about quasar duty cycles, light curves, or lifetimes, only our 
determination of the global mass density in gas-rich major 
mergers. 

In our model, the sharp rise and fall of the quasar lumi- 
nosity density over cosmic time is the product of several fac- 
tors. At high redshifts, the buildup of BH mass from z > 6 to 
z ~ 2 owes in part to the growth of galaxy and halo mass, as 
most galaxies are rapidly forming, and the galaxy mass den- 
sity involved in major mergers steadily increases with time. 
The rise is steeper than that in, for example, the global star 
formation rate density of the Universe, as it tracks just the 
major merger history (effectively, at these redshifts, the rise 
in the density of relatively massive "small group" sized ha- 
los), as opposed to the global buildup of the (relatively lower- 
mass) halos hosting the most rapidly star-forming galaxies. 
Below redshift z ~ 2, merger rates begin to decline for all 
galaxies, and the exhaustion of gas in evolved systems slows 
the growth of quasars in two ways. First, major mergers 
of relatively gas-poor disks create shallower central poten- 
tial wells for the remnant spheroid (i.e. lower a values), and 
as a consequence BH growth self-regulates at lower masses 
(Hopk ins et al.ll2007cT) . in agreement with the obse rved evolu- 
tion o f the BH-host correlations with redshift (e.g. JPeng et alj 

2006) . Second, an increasing fraction of galaxies (especially 
around ~ L*, where most of the mass density resides) have 
already undergone major mergers and exist as "quenched" 
spheroids (with very little remaining cold, rotationally sup- 
ported gas) whose major mergers will not excite quasar activ- 
ity. Recent high-resolution cosmological simulations which 
attempt to resolve the r elevant merger and feedback effects 
regul ating BH growth dSiiacki et all 120071: iDi Matteo etaT] 

2007) further support this scenario, with the combination of 
these effects and, primarily, the merger history of the Universe 
regulating BH growth (at least at redshifts z < 6). The product 
of these effects yields the observed steep rise and fall of the 
quasar population with respect to its peak at z ~ 2, in good 
agreement with the observations and in contrast with the sub- 
stantially more extended global star formation history of the 
Universe. 

We compare this model to one in which quasar fueling 
is primarily driven by secular processes - i.e. disk instabili- 
ties, bars, harassment, or any process which operates in non- 
merging, gas-rich systems. We demonstrate that there are a 
number of robust, qualitatively distinct predictions from these 
models, including: 

Quasar Clustering: A merger-driven model accurately pre- 
dicts the observed large-scale clustering of quasars (both 
at ~ L* and as a detailed function of luminosity) as a 
function of redshift for the observed range z ~ 0.5 - 4. 
The clustering is, at all these redshifts, precisely that 
predicted for "small group" halos in which major merg- 
ers of gas-rich galaxies should proceed most efficiently. 
It is well-established empirically th at quasar clustering 
traces a characteristic host halo mass dPorciani et al.l 120041: 
Wake et al.l 12004 ICroom et alj l2005t iPorciani & Norberd 
20061 iMyers et alJ l2006at Ida Angela et all 120061: ICoil et alJ 
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l2007t IShen et all l2007t iHopkins et al.1 l2007dh . and inves- 
tigations o f the quasar proximity effe ct reach a similar 
conclusion dFaucher-Giguere et alJ|2007l: iKim & Croftll2007l: 
Nascimento Guimaraes et al. 2007). Comparing this to inde- 
pendent, direct measurements of the small group scale of ~ L* 
gas-rich galaxies, and to the small group scale inferred from a 
wide variety of different halo occupation models, we show in 
all cases that these trace the same mass. In contrast, the clus- 
tering of typical star-forming galaxies is somewhat weaker (as 
expected relative to their small group scale), and yields an 
underestimate of quasar clustering at moderate and high red- 
shifts. Only at low redshifts (z < 0.5) is there reasonable con- 
sistency between the clustering o f ~ quasars and "se cular" 
populations (for more details, see IHopkins et af]|2007dl) . 

Small-Scale Environments: Mergers will preferentially oc- 
cur in environments with an overdensity of galaxies on small 
scales, and as a consequence their clustering should reflect a 
bias (relative to a mean galaxy of the same mass) to excess 
clustering on small scales. Furthermore, triggering of binary 
quasars in (even a small fraction of) early interacting pairs 
can enhance this excess. Indeed, in a purely e mpirical sense, 
both bright quasars at all redshifts z ~ 0-5 - 3 dHennawi et al.l 
I2006t ISerber et al]l2006t iMvers et alj|2006bh a nd local post- 
starburst merger remnant galaxies ~(lGotoll2005h are observed 
to have similar, strong excess clustering on small scales, dis- 
tinct from quiescent (non-merger related) populations. This 
is true both in terms of the quasar-quasar autocorrelation, 
and for the quasar-galaxy cross-correlation, suggesting that 
it reflects a true tendency for quasars to reside in regions of 
small-scale overdensity. Our model predicts the magnitude of 
this excess clustering as a function of physical scale and red- 
shift well for both populations. Interestingly, low-luminosity 
Seyfert galaxies {Mb > -23) are observe d without such an ex- 
cess on small scales ( Ser ber et al.l l2006). as expected if AGN 
triggering at low luminosities (or typical Mbh < 10 7 M Q ) is 
dominated by secular processes (with the true quasar popula- 
tions dominated by mergers). However, systems of these low 
luminosities contribute significantly to the quasar luminosity 
density at only very low redshifts z < 0.5, once more massive 
systems have predominantly quenched. 

Host Galaxy Colors: The stellar population colors of a 
gas-rich merger remnant will rapidly redden, at least over 
the ~Gyr period over which subsequent infall or cooling 
can be ignored, and the system will (even if only temporar- 
ily) cross the "green valley" between the blue cloud and red 
sequence. If a quasar is triggered at the end of a merger, 
the decay of the quasar lightcurve should be associated with 
the host crossing this interval, or equivalently with the pres- 
ence of a relatively young, blue host spheroid. Observed 
quasar hosts at z ~ 0.5- 1.1 appear to preferentially occupy 
this (o therwise relatively empty) locus in color -magnitude 
space dSanchez et all l2004t iNandra et all [2006), and it is 
well-established that bright quasar hosts tend to be massive 
spheroids with espec ially young stellar or post-starburst stel- 
lar populations (e.g.lCanalizo & Stocktqnll200lHJahnke et alJ 
l2004aHVanden Berk et al.ll2006tlBarthelll2006l and references 
therein). We show that the color distribution of observed 
quasar hosts is similar to that observed for clear post-starburst 
merger remnant populations. In contrast, a secular model 
(regardless of the quasar duty cycle or lifetime) would pre- 
dict that quasar hosts trace the population of systems host- 
ing strong disk instabilities or bars (unless any quasar activity 
could somehow be suppressed over the entire lifetime of a 
relatively long-lived bar) - these actually tend to be the most 



blue, gas-rich disk galaxies. We show that the observed col- 
ors of quasar hosts are distinct from those of systems observed 
hosting strong bars. 

Host Kinematics (Pseudobulges versus Classical Bulges): 
Numerical simulations and observations of both barred sys- 
tems and merger remnants have established that mergers yield 
systems with the observed kinematic and photometric prop- 
erties of classical bulges, whereas secular disk instabilities 
generically give rise to pseudobulges with distinct properties 
(see the discussion in §[T|). At high redshifts z > 1, the ac- 
tive ~ quasar populations (either from direct quasar BH 
mass measurements or simply the Eddington argument) are 
dominated by massive BHs (Mbh ^ 10 8 M Q ), which are di- 
rectly observed t o live in massive bulges at those redshifts 
(Peng et al. 2006), and whose remnants clearly live in massive 
bulges locally. These spheroids (M sp h > 10 n M Q ) are over- 
whelmingly classical spheroids (in particular, classical true 
ellipticals), whose kinematics argue that they were formed in 
mergers. To the extent that the buildup of BH mass traces 
spheroid origin (true at all redshifts observed, albeit with po- 
tentially redshift-dependent efficiency), this implies forma- 
tion in mergers. Adopting a number of different estimates 
of e.g. the pseudobulge fraction as a function of host prop- 
erties, pseudobulge mass distributions, or simply assuming 
all bulges in star-forming/disk-dominated galaxies are formed 
via secular instabilities, we compare with the distribution of 
active BH masses in the quasar luminosity function at all red- 
shifts, and show that these populations cannot dominate the 
QLF at redshifts z > 1 . Only at low redshifts z < 1 are the 
global QLF and buildup of BH mass occurring mainly in sys- 
tems which typically reside in star-forming, disk-dominated 
hosts with pseudobulges potentially formed via disk instabili- 
ties or bars. 

Quasar Luminosity Density versus Redshift: As noted 
above, a merger-driven model predicts a sharp rise and fall 
of the quasar luminosity density in good agreement with ob- 
servations. If, for the sake of argument, we adopt a model 
in which all BH growth is driven by disk instabilities, we 
demonstrate that, once embedded in a proper cosmological 
context, such a model is generically forced to predict a his- 
tory of quasar luminosity density which is offset to earlier 
times (in each of its rise, peak, and fall), in conflict with 
the observations. This is because major mergers are dynami- 
cally inevitable - one cannot simply "remove" the mergers a 
galaxy will undergo in a true cosmological model. In order 
for disk instabilities to dominate BH growth or spheroid for- 
mation, they must, therefore, act before massive systems un- 
dergo their major mergers. Since the global mass flux in gas- 
rich major mergers peaks around z ~ 2-3, a secular-dominant 
model is forced to assume a sufficiently strong disk instabil- 
ity mode such that the progenitors of these systems rapidly 
exhaust their gas supplies and build up most of their final 
BH/spheroid mass at redshifts z > 4. By z ~ 2, then, these 
models predict the quasar luminosity density is already in 
rapid decline. We demonstrat e this both for cur rent state-of- 
the-art semi-analytic models dBower et alj|2006l) . constrained 
such that they cannot overproduce the z = mass density in 
quenched systems nor "avoid" major mergers, and simple il- 
lustrative toy models. The only way to avoid this is to weaken 
the disk instability criterion - i.e. to assume disk instabilities 
are not so efficient at exhausting systems, and can therefore 
act continuously over longer times. But then, one obtains a 
prediction similar to our expectation from assuming all pseu- 
dobulges are formed in disk instabilities - namely, the high 
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rate of gas-rich mergers at high redshifts will dominate quasar 
activity at all z > 1, and this "gentler" disk instability mode 
will dominate at lower luminosities (i.e. only dominate BH 
mass buildup at low masses Mbh < 10 7 M Q ), becoming im- 
portant to the total luminosity density only at z < 1 . 

These comparisons, despite the very different possible sys- 
tematic effects in the observations, all suggest a similar sce- 
nario. Secular (non-merger related) fueling mechanisms may 
dominate AGN activity in low-BH mass systems (M BH < 
10 7 M Q ), for which mergers are relatively rare and hosts tend 
to be very gas-rich, potentially bar-unstable disks, but these 
contribute little to quasar activity at z > 1, which involves 
the most massive Mbh ^ 10 8 M Q BHs in the most massive 
spheroids. By z ~ 0.5, however, the most massive BHs are no 
longer active (their hosts having primarily been gas exhausted 
and quenched, and with overall merger rates declining), and a 
significant fraction of the AGN luminosity density can come 
from ~ 10 7 M Q BHs in undisturbed hosts, corresponding to 
relatively low-luminosity (Mb > -23) Seyfert galaxies. By 
z ~ 0, the local QLF is largely dominated by Seyfert activ- 
ity in relatively small BHs with late-type, undisturbed host 
disks dHeckman et al.l 120041) . Our models allow for secu- 
lar mechanisms, such as th e stochastic triggering model of 
Hopk ins & Hernquistl(l2006l) . to be important at low luminosi- 
ties, and a pure comparison between this secular model and 
our merger-driven prediction here yields a transition to secu- 
lar dominance at low luminosities in good agreement with the 
empirical constraints. 

Ultimately, one would like to test this by directly studying 
the morphology of true, bright quasar hosts at high redshifts. 
Unfortunately, as discussed in § Q] this remains extremely 
difficult, and results have been ambiguous. As noted previ- 
ously, mock observa tions constructed f rom numerical major 
merger simulations (Krause et al. 2007) imply that, with the 
best presently attainable data, the faint, rapidly fading tidal 
features associated with the quasar phase (i.e. final stages 
of the merger, at which the spheroid is largely formed and 
has begun to relax) are difficult to observe even locally and 
(for now) nearly impossible to identify at the redshifts of 
greatest interest (z > 1). Similarly, experiment s with au- 
tomat ed, non-parametric classification schemes (lLotz et al.1 
2004) suggest that the hosts will generically be classified 
as "normal" spheroids, even with perfect resolution and no 
surface bri ghtness dimm i ng. T his appears to be borne out, 
as recently Bennert et al. (2007) have re-examined very low- 
redshift quasars previously recognized from deep HST imag- 
ing as having relaxed spheroid hosts, and found (after consid- 
erably deeper integrations) that every such object shows clear 
evidence for a recent merger. The ability to identify such fea- 



tures may be slightly improved if one considers just the popu- 
lation of highly dust-reddened (but still dominated by quasar 
light in the optical/near IR) or IR-luminous quasar expected 
to be associated with a (brief) "blowout" stage preceding the 
more typical optical quasar phase in a merger, and it does 
appear that observations of quasars in this stage, somewhat 
closer to the peak of merger activity, show ubiquitous evi- 
dence of recent or ongoing mergers (Hutchingsetal. 2003, 
1 20061: iKawakatu et al.ll2006t iGuyon et alJl2006t : lUrrutia et atl 
120071) . albeit still requiring very deep integrations. 

On the other hand, it is increasingly possible to improve the 
constraints we have studied in this paper, to break the degen- 
eracy between secular and merger-driven models of quasar 
fueling. Improving measurements of merger fractions, mass 
functions, and clustering at low redshifts, and extending these 
measurements to high redshifts, can break the degeneracies in 
our cosmological models (regarding, for example, the appro- 
priate merger timescales at high redshifts) and enable more 
robust, tightly constrained predictions. We have also made 
a large number of predictions in this paper an d previous re- 
lated works (e.g. iHopkins et al1l2006al l2007dl) which can be 
directly tested without the large ambiguities presently inher- 
ent in quasar host morphology estimates. Better observations 
of quasar host galaxy colors (and corresponding estimates of 
their recent star formation history), improved measurements 
of quasar clustering at redshifts z > 3 (especially measure- 
ments which can resolve ~ L* quasars at these redshifts), de- 
tailed cross-correlation measurements of quasars and other 
galaxy populations and clustering measurements which can 
decompose the excess bias of quasars on small scales as a 
function of e.g. redshift and luminosity, improved constraints 
on the bolometric corrections of the brightest quasars and 
the history of the bolometric quasar luminosity density at 
z > 3-4, and estimates of the evolution with redshift of pseu- 
dobulge populations will all be able to test the models pre- 
sented in this paper. The combination of these observations 
can greatly strengthen the constraints herein, and ultimately 
allow for more detailed modeling which attempts not just to 
predict the general origin of quasars in mergers, but to fully 
break down the contribution of major mergers (or mergers of 
different types) and other fueling mechanisms to the quasar 
luminosity functions as a function of luminosity and redshift. 
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